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HUMAN TRANSPORT-ASSOCIATED MOLECULES 

TECHNICAL FIELD 

This invention relates to nucleic acid and amino acid sequences of human 
transport-associated molecules and to the use of these sequences in the diagnosis, 
treatment, and prevention of cancer and transport disorders. 



BACKGROUND OF THE INVENTION 

10 Eukaryotic cells are bound by a lipid bilayer membrane and subdivided into 

functionally distinct, membrane bound compartments. The membranes maintain the 
essential differences between the cytosol, the extracellular environment, and the contents 
of each intercellular organelle. As lipid membranes are highly impermeable to most polar 
molecules transport of essential nutrients, metabolic waste products, cell signaling 

15 molecules, macromolecules and proteins across lipid membranes and between organelles 
must be mediated by a variety of transport-associated molecules. 

Transport of molecules between organelles or vesicles is essential for cell function. 
Eukaryotic proteins are synthesized within the endoplasmic reticulum (ER), delivered 
from the ER to the Golgi complex for post-translational processing and sorting, and 

20 transported from the Golgi to specific intracellular and extracellular destinations. This 
intracellular and extracellular transport of protein molecules is termed vesicle trafficking. 
Trafficking is accomplished by the packaging of protein molecules into specialized 
vesicles which bud from the donor organelle membrane, are transported, and fuse to the 
target membrane. Specialized cell types utilize specific vesicle trafficking routes. For 

25 instance, in endocrine glands, hormones and other secreted proteins are delivered to 
secretory granules for exocytosis through the plasma membrane to the cell exterior. In 
macrophages, peroxidases and proteases are delivered to lysosomes. In fat and muscle 
cells, glucose transporters are stored in vesicles which fuse with the plasma membrane in 
response to insulin stimulation. 

30 Numerous proteins are necessary for the formation, targeting, and fusion of 

transport vesicles and for the proper sorting of proteins into these vesicles. The vesicle 
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trafficking machinery includes coat proteins which promote the budding of vesicles from 
donor membranes; vesicle- and target-specific identifiers (v-SNAREs and t-SNAREs) 
which bind to each other and dock the vesicle to the target membrane; and proteins which 
bind to SNARE complexes and initiate fusion of the vesicle to the target membrane 
5 (SNAPs). 

The Rab family of small GTP-binding proteins are key regulators of vesicle 
transport events. Different Rab proteins are associated with distinct subcellular 
compartments and their vesicular carriers. GTP binding and hydrolysis of Rab proteins 
regulates the specificity and direction of vesicle transport. Over 40 Rab proteins have 
10 been identified in mammalian cells with sequences that share between 35% and 95% 
identity indicating a broad range of functional specificities. Usually, these proteins are 
associated with specific cellular organelles. For example, Rabl is localized to the ER and 
Golgi complex. Rab2 in the transitional ER and the cis Golgi network, Rab3 to secretory 
vesicles, Rab4 to early endosomes, Rab5 to early endosomes and the plasma membrane, 
15 Rab6 to medial and trans Golgi cistemae, Rab7 to late endosomes, and Rab9 to late 
endosomes and the trans Golgi network. In addition, RAB proteins are localized to 
specific tissue types. RAB 1 7 is found in epithelial cells which contain distinct apical, 
basolateral, and transcytotic transport pathways, while Rab3a, Rabl 5, and Rab23 are 
predominantly expressed in the brain and nervous system. (Kuge, O. et al. (1993) J. Cell. 
Biol. 123:1727-1734; Olkkonen, V. M. et al. (1994) Gene 138:207-21 1; Olkkonen, V. M. 
et al. (1997) Int. Rev. Cytol. 176:1-85; and Stahl, B. (1994) J. Biol. Chem. 269:24770- 
24776.) 

The Golgi apparatus lies at the heart of the vesicle transport pathway, and consists 
of a stack of cisternae, together with the cis- and trans-Golgi networks on either face of the 
stack. Molecules are transported through the sequence of compartments that comprise the 
Golgi apparatus by vesicles which bud from one compartment and fuse with the next. 
During mitosis, the Golgi apparatus is disassembled into hundreds of vesicles. These 
fragments can then be partitioned between the two daughter cells. The majority of mitotic 
Golgi fragments are small uniform vesicles, produced by a coatomer I-dependent 
mechanism (COPI) which is responsible for both anterograde and retrograde transport 
through the Golgi stack. There is also a COPI-independent pathway which produces 
larger, more heterogeneous fragments. It is hypothesized that the accumulation of Golgi 
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fragments during mitosis is related to inhibition of membrane fusion. The pi 15 protein 
has been implicated in three Golgi- trafficking steps: 1) intra-Golgi traflBc; 2) endoplasmic 
reticulum to Golgi traffic; and 3) transcytosis. The pi 15 has been shown to be involved in 
a docking step prior to membrane fusion, pi 1 5 binding to Golgi membranes is 

5 specifically inhibited under mitotic conditions as a result of modifications to the 
membranes. An early stage of mitotic Golgi disassembly in vitro can be inhibited by 
increasing pi 15 receptor occupancy with excess pi 15. These data suggest that the mitotic 
inhibition of pi 15 binding contributes to the inhibition of membrane fusion by acting at 
the early step of vesicle docking. (Sapperstein, S. K. et al. (1996) J. Cell Biol. 

10 132:755-767.) 

Vesicles in the process of budding from the ER and the Golgi are covered with a 
protein coat similar to the clathrin coat of endocytotic vesicles. The protein coat is 
assembled from cytosolic precursor molecules and is confined to budding regions of the 
organelle membrane. The coat protein (COP)-coated vesicles are uncoated after budding 

15 is complete to allow fusion of the vesicle to the target membrane. 

The COP coat consists of two major components, a guanosine triphosphatase and 
coat protomer (coatomer). The individual proteins are cytosolic until assembled into the 
protein coat. Coatomer is an equimolar complex of seven proteins, termed alpha-, beta-, 
beta'-, gamma-, delta-, epsilon- and zeta-COP. The coatomer complex binds to dilysine 

20 motifs and reversibly associates with Golgi non-clatherin coated vesicles. These vesicles 
further mediate transport from the ER , via the Golgi, to the trans-Golgi network. 
Coatomer complex is required for budding from the Golgi membranes and is essential for 
retrograde Golgi to ER transport of dilysine motif containing proteins. Polyclonal 
antibody directed to zeta-COP blocks the binding of coatomer to Golgi membranes and 

25 prevents the assembly of COP-coated vesicles on Golgi cistemae. Assay of a conditional 
lethal, temperature-sensitive epsilon-COP mutant of Chinese hamster ovary (CHO) cells 
indicates that epsilon-COP and associated COP coatomers may be necessary for the 
establishment or maintenance of Golgi structure, for proper ER-to-Golgi transport of 
integral membrane and secreted proteins, and for normal endocytotic recycling of low- 

30 density lipoprotein receptors. (Kuge, O. et al. (1993) supra; and Cosson, P. et al. (1996) 
EMBOJ. 15:1792-1798). 

Transport of proteins and RNA between the nucleus and the cytoplasm occurs 

-3- 
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cytoplasmic p53 protein were wild type. In contrast, p53 cDNAs derived from breast 
cancers with nuclear P 53 protein contained a variety of missense mutations and a nonsense 
mutation. It has been suggested that in some breast cancers, the tumor-suppressing 
activity of p53 is inactivated by the sequestration of the protein in the cytoplasm, away 
5 from its site of action in the cell nucleus. Cytoplasmic wild-type p53 was also found in 
human cervical carcinoma cell lines. (Moll, U.M. et al. (1992) Proc. Natl. Acad. Sci. USA 
89:7262-7266; and Liang, X.H. et al. (1993) Oncogene 8:2645-2652.) 

Transport across membranes also depends on transporters, or pumps, membrane- 
spanning proteins which bind to specific classes of molecules and undergo a series of 
10 conformational changes in order to transfer the bound molecule across a membrane. 

Transport by pumps can occur by a passive, concentration-dependant mechanism or can be 
linked to an energy source such as ATP hydrolysis or an ion gradient. Examples include 
facilitative transporters, the secondary active symporters and antiporters driven by ion 
gradients, and active ATP binding cassette transporters involved in multiple-drug 
15 resistance and targeting of antigenic peptides to MHC Class I molecules. Transported 
substrates range from nutrients and ions to a broad variety of drugs, peptides and proteins. 

The energy requirements of most mammalian cells are met through a continuous 
supply of glucose which circulates in the blood. Glucose enters cells through specific 
glucose transporter molecules present in the plasma membrane. The family includes 
20 passive transporters typical of mammalian tissues and active, H(+)-linked sugar 

transporters from bacteria. These transporters characteristically contain two groups of six 
putative membrane-spanning alpha-helices separated by large, hydrophilic, cytoplasmic 
regions. Both the N-terminal and C-terminal regions of the sequence are also predicted to 
be cytoplasmic. Biophysical studies on the human erythrocyte glucose transporter indicate 
25 that the membrane-spanning alpha-helices associate to form a hydrophilic channel or a 
substrate-binding cleft extending across the membrane. The mechanism of substrate 
translocation involves alternate exposure of the substrate-binding site to each face of the 
membrane via a conformational change. (Pessin, J. E. and Bell G. I. (1992) Annu Rev 
Physiol 54:91 1-930.) 

30 Transporters play a major role in the regulation of pH, excretion of drugs, and the 

cellular K+/Na+ balance. Monocarboxylate anion transporters are proton-coupled 
symporters with a broad substrate specificity that includes L-lactate, pyruvate, and the 
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ketone bodies acetate, acetoacetate, and beta-hydroxybutyrate. At least seven isoforms 
have been identified to date. The isoforms are predicted to have twelve transmembrane 
(TM) helical domains with a large intracellular loop between TM6 and TM7, and play a 
critical role in maintaining intracellular pH by removing the protons that are produced 
5 stoichiometrically with lactate during glycolysis. The best characterized 

H(+)-monocarboxylate transporter is that of the erythrocyte membrane, which transports 
L-lactate and a wide range of other aliphatic monocarboxylates. Other cells possess 
H(+)-linked monocarboxylate transporters with differing substrate and inhibitor 
selectivities. In particular, cardiac muscle and tumor cells have transporters that differ in 

10 their K m values for certain substrates, including stereoselectivity for L- over D-lactate, and 
in their sensitivity to inhibitors. There are Na(+)-monocarboxylate cotransporters on the 
luminal surface of intestinal and kidney epithelia, which allow the uptake of lactate, 
pyruvate, and ketone bodies in these tissues. In addition, there are specific and selective 
transporters for organic cations and organic anions in organs including the kidney, 

15 intestine and liver. Organic anion transporters are selective for hydrophobic, charged 
molecules with electron-attracting side groups. Organic cation transporters, such as the 
ammonium transporter, mediate the secretion of a variety of drugs and endogenous 
metabolites, and contribute to the maintainence of intercellular pH. (Poole, R. C. and 
Halestrap, A.P. (1993) Am. J. Physiol. 264:C761-C782; Price, N. T. et al. (1998) 

20 Biochem. J. 329:321-328; and Martinelle, K. and Haggstrom, I. (1993) J. Biotechnol. 30: 
339-350.) 

ATP-binding cassette (ABC) transporters are members of a superfamily of 
membrane proteins that transport small hydrophilic molecules across biological 
membranes. They are comprised of two homologous halves, each containing two parts: a 

25 transmembrane domain with multiple transmembrane segments and a nucleotide binding 
domain. Mammalian ABC transporters are found either as complete transporters, e.g. the 
multiple drug resistance transporter and the cystic fibrosis transmembrane regulator 
proteins, or as half transporters, e.g., the transporters associated with antigen processing; 
TAP1 and TAP2 proteins, which dimerize to form the active TAP transporter. Two half 

30 ABC transporters have been identified in the human peroxisome membrane: the 

adrenoleukodystrophy protein (ALDP) and the 70-kDa peroxisomal membrane protein 
(PMP70). Mutations in the adrenoleukodystrophy gene cause X-linked 
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adrenoleukodystrophy, an inborn error of peroxisomal P-oxidation of very long chain fatty 
acids. Mutations in the PMP70 genes have been found in patients with Zellweger 
syndrome, an inborn error of peroxisome biogenesis. Multidrug resistance (MDR) results 
from overproduction of another member of the ABC transporter family, P-glycoprotein. 
5 MDR is primarily due to increased drug extrusion from the resistant cells by P- 

glycoprotein. The P-glycoproteins have 2 homologous halves, each with 6 hydrophobic 
segments adjacent to a consensus sequence for nucleotide binding. The hydrophobic 
segments may form a membrane channel, whereas the nucleotide binding site may be 
involved in energization of drug transport. (Saurin, W. et al. (1994) Mol. Microbiol. 
10 12:993-1004; Shani, N., et al. (1996) J. Biol. Chem. 271:8725-8730; and Koster, W., and 
Bohm, B. (1992) Mol. & Gen. Genet. 232:399-407.) 

The etiology of numerous human diseases and disorders can be attributed to 
defects in the transport of proteins through membranes, to organelles or the cell surface. 
Defects in the trafficking of membrane-bound transporters and ion channels are associated 
with several disorders, e.g. cystic fibrosis, glucose-galactose malabsorption syndrome, 
hypercholesterolemia, von Gierke disease, and certain forms of diabetes mellitus. 
Abnormal hormonal secretion is linked to disorders, e.g., diabetes insipidus (vasopressin), 
hyper- and hypoglycemia (insulin, glucagon). Grave's disease and goiter (thyroid 
hormone), and Cushing's and Addison's diseases (adrenocorticotropic hormone). Single- 
gene defect diseases resulting in an inability to transport small molecules across 
membranes include, e.g., cystinuria, iminoglycinuria, Hartup disease, and Fanconi disease, 
(van't Hoff, W.G. (1996) Exp. Nephrol. 4:253-262; Talente, G. M. et al. (1994) Ann. 
Intern. Med. 120:218-226; and Chillon, M. et al. (1995) New Eng. J. Med. 
332:1475-1480.) 

Cancer cells secrete excessive amounts of hormones or other biologically active 
peptides. Disorders related to excessive secretion of biologically active peptides by tumor 
cells include, e.g., fasting hypoglycemia due to increased insulin secretion from 
insulinoma-islet cell tumors; hypertension due to increased epinephrine and 
norepinephrine secreted from pheochromocytomas of the adrenal medulla and sympathetic 
paraganglia; and carcinoid syndrome, which includes abdominal cramps, diarrhea, and 
valvular heart disease, caused by excessive amounts of vasoactive substances, e.g., 
serotonin, bradykinin, histamine, prostaglandins, and polypeptide hormones, secreted from 
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intestinal tumors. Ectopic synthesis and secretion of biologically active peptides includes, 
e.g., ACTH and vasopressin in lung and pancreatic cancers; parathyroid hormone in lung 
and bladder cancers; calcitonin in lung and breast cancers; and thyroid-stimulating 
hormone in medullary thyroid carcinoma. (Schwartz, M.Z. (1997) Semin. Pediatr. Surg. 
5 3:141-146; and Said, S.I. and Faloona, G.R. (1975) N. Engl. J. Med. 293:155-160.) 
The discovery of new human transport-associated molecules and the 
polynucleotides encoding it satisfies a need in the art by providing new compositions 
which are useful in the diagnosis, treatment, and prevention of cancer and transport 
disorders. 

10 

SUMMARY OF THE INVENTION 

The invention features substantially purified polypeptides, human transport- 
associated molecules, referred to collectively as "TRANP" and individually as "TRANP- 
1", "TRANP-2", "TRANP-3", "TRANP-4", "TRANP-5", "TRANP-6", "TRANP-7", 

1 5 "TRANP-8", and "TRANP-9". In one aspect, the invention provides a substantially 
purified polypeptide comprising an amino acid sequence selected from the group 
consisting of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, 
SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, and SEQ ID NO:9, and fragments thereof. 
The invention further provides a substantially purified variant having at least 90% 

20 amino acid identity to the amino acid sequences of SEQ ID NO: 1 , SEQ ID NO:2, SEQ ID 
NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, and 
SEQ ID NO:9, or to a fragment of any of these sequences. The invention also provides an 
isolated and purified polynucleotide sequence encoding the polypeptide comprising an 
amino acid sequence selected from the group consisting of SEQ ID NO:l, SEQ ID NO:2, 

25 SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID 
NO:8, and SEQ ID NO:9, and fragments thereof. The invention also includes an isolated 
and purified polynucleotide variant having at least 90% polynucleotide identity to the 
polynucleotide sequence encoding the polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ 

30 ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, and SEQ ID NO:9, 
and fragments thereof. 

The invention further provides an isolated and purified polynucleotide which 

-8- 
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hybridizes under stringent conditions to the polynucleotide sequence encoding the 
polypeptide comprising an amino acid sequence selected from the group consisting of 
SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3 5 SEQ ID NO:4, SEQ ID NO:5, SEQ ID 
NO:6, SEQ ID NO:7, SEQ ID NO:8, and SEQ ID NO:9, and fragments thereof, as well as 
5 an isolated and purified polynucleotide sequence which is complementary to the 

polynucleotide sequence encoding the polypeptide comprising the amino acid sequence 
selected from the group consisting of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ 
ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, and SEQ ID NO:9, 
and fragments thereof. 

10 The invention also provides an isolated and purified polynucleotide comprising a 

polynucleotide sequence selected from the group consisting of SEQ ID NO: 1 0, SEQ ID 
NO:l 1, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, 
SEQ ID NO: 1 7, and SEQ ID NO: 1 8 ? and fragments thereof. The invention further 
provides an isolated and purified polynucleotide variant having at least 90% 
polynucleotide identity to the polynucleotide sequence comprising a polynucleotide 
sequence selected from the group consisting of SEQ ID NO: 10, SEQ ID NO:l 1, SEQ ID 
NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, 
SEQ ID NO: 18, and fragments thereof, as well as an isolated and purified polynucleotide 
which is complementary to the polynucleotide sequence comprising a polynucleotide 
selected from the group consisting of SEQ ID NO: 10, SEQ ID NO:l 1, SEQ ID NO: 12, 
SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, and 
SEQ ID NO: 1 8, and fragments thereof 

The invention further provides an expression vector containing at least a fragment 
of the polynucleotide encoding the polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1 , SEQ ID NO:2, SEQ ID NO:3, SEQ 
ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, and SEQ ID NO:9, 
and fragments thereof In another aspect, the expression vector is contained within a host 
cell. 

The invention also provides a method for producing a polypeptide comprising the 
amino acid sequence of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ 
ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, or SEQ ID NO:9, or fragments 
thereof, the method comprising the steps of: (a) culturing the host cell containing an 
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expression vector containing at least a fragment of a polynucleotide sequence encoding 
TRANP under conditions suitable for the expression of the polypeptide; and (b) recovering 
the polypeptide from the host cell culture. 

The invention also provides a pharmaceutical composition comprising a 
5 substantially purified polypeptide selected from the group consisting of the amino acid 
sequence of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, 
SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, or SEQ ID NO:9, and fragments thereof in 
conjunction with a suitable pharmaceutical carrier. 

The invention further includes a purified antibody which binds to a polypeptide 

10 comprising the amino acid sequence of SEQ ID NO: 1, SEQ ID NO:2, SEQ ID NO:3, SEQ 
ID NO:4, SEQ ID NO:5. SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8 ? or SEQ ID NO:9, 
or fragments thereof, as well as a purified agonist and a purified antagonist to the 
polypeptide. The invention also provides a method for treating or 

preventing a cancer associated with increased expression or activity of TRANP, the 

15 method comprising administering to a subject in need of such treatment an effective 
amount of an antagonist of the polyp eptide selected from the group consisting of SEQ ID 
NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ 
ID NO:7, SEQ ID NO:8, or SEQ ID NO:9, and fragments thereof. 

The invention also provides a method for treating or preventing a transport disorder 

20 associated with increased expression or activity of TRANP, the method comprising 
administering to a subject in need of such treatment an effective amount of an antagonist 
of the polypeptide selected from the group consisting of SEQ ID NO:l, SEQ ID NO:2, 
SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID 
NO:8, or SEQ ID NO:9, and fragments thereof. 

25 The invention also provides a method for treating or preventing a transport disorder 

associated with decreased expression or activity of TRANP, the method comprising 
administering to a subject in need of such treatment an effective amount of the polypeptide 
selected from the group consisting of SEQ ID NO:l, SEQ ID NO:2, SEQ ID NO:3, SEQ 
ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, or SEQ ID NO:9, 

30 and fragments thereof 

The invention also provides a method for detecting a polynucleotide encoding a 
polypeptide selected from the group consisting of SEQ ID NO:l, SEQ ID NO:2, SEQ ID 

-10- 
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N0:3, SEQ ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, or 
SEQ ID NO:9, and fragments thereof, in a biological sample containing nucleic acids, the 
method comprising the steps of (a) hybridizing the complement of the polynucleotide 
sequence encoding the polypeptide comprising SEQ ID NO:l, SEQ ID NO:2, SEQ ID 

5 NO:3, SEQ ID NO:4, SEQ ID NO:5 5 SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, or 
SEQ ID NO:9, or fragments thereof to at least one of the nucleic acids of the biological 
sample, thereby forming a hybridization complex; and (b) detecting the hybridization 
complex, wherein the presence of the hybridization complex correlates with the presence 
of a polynucleotide encoding TRANP in the biological sample. In one aspect, the nucleic 

10 acids of the biological sample are amplified by the polymerase chain reaction prior to the 
hybridizing step. 

DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleotide sequences, and methods are described, it is 

15 understood that this invention is not limited to the particular methodology, protocols, cell 
lines, vectors, and reagents described, as these may vary. It is also to be understood that 
the terminology used herein is for the purpose of describing particular embodiments only, 
and is not intended to limit the scope of the present invention which will be limited only 
by the appended claims. 

20 It must be noted that as used herein and in the appended claims, the singular forms 

"a," "an," and "the" include plural reference unless the context clearly dictates otherwise. 
Thus, for example, a reference to "a host cell" includes a plurality of such host cells, and a 
reference to "an antibody" is a reference to one or more antibodies and equivalents thereof 
known to those skilled in the art, and so forth. 

25 Unless defined otherwise, all technical and scientific terms used herein have the 

same meanings as commonly understood by one of ordinary skill in the art to which this 
invention belongs. Although any methods and materials similar or equivalent to those 
described herein can be used in the practice or testing of the present invention, the 
preferred methods, devices, and materials are now described. All publications mentioned 

30 herein are cited for the purpose of describing and disclosing the cell lines, vectors, and 
methodologies which are reported in the publications and which might be used in 
connection with the invention. Nothing herein is to be construed as an admission that the 
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invention is not entitled to antedate such disclosure by virtue of prior invention. 
DEFINITIONS 

"TRANP," as used herein, refers to the amino acid sequences of substantially 
5 purified TRANP obtained from any species, particularly a mammalian species, including 
bovine, ovine, porcine, murine, equine, and preferably the human species, from any 
source, whether natural, synthetic, semi-synthetic, or recombinant. 

The term "agonist," as used herein, refers to a molecule which, when bound to 
TRANP, increases or prolongs the duration of the effect of TRANP. Agonists may 
10 include proteins, nucleic acids, carbohydrates, or any other molecules which bind to and 
modulate the effect of TRANP. 

An "allele" or an "allelic sequence," as these terms are used herein, is an 
alternative form of the gene encoding TRANP. Alleles may result from at least one 
mutation in the nucleic acid sequence and may result in altered mRNAs or in polypeptides 
15 whose structure or function may or may not be altered. Any given natural or recombinant 
gene may have none, one, or many allelic forms. Common mutational changes which give 
rise to alleles are generally ascribed to natural deletions, additions, or substitutions of 
nucleotides. Each of these types of changes may occur alone, or in combination with the 
others, one or more times in a given sequence. 

"Altered" nucleic acid sequences encoding TRANP, as described herein, include 
those sequences with deletions, insertions, or substitutions of different nucleotides, 
resulting in a polynucleotide the same TRANP or a polypeptide with at least one 
functional characteristic of TRANP. Included within this definition are polymorphisms 
which may or may not be readily detectable using a particular oligonucleotide probe of the 
polynucleotide encoding TRANP, and improper or unexpected hybridization to alleles, 
with a locus other than the normal chromosomal locus for the polynucleotide sequence 
encoding TRANP. The encoded protein may also be "altered," and may contain deletions, 
insertions, or substitutions of amino acid residues which produce a silent change and result 
in a functionally equivalent TRANP. Deliberate amino acid substitutions may be made on 
the basis of similarity in polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or 
the amphipathic nature of the residues, as long as the biological or immunological activity 
of TRANP is retained. For example, negatively charged amino acids may include aspartic 
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acid and glutamic acid, positively charged amino acids may include lysine and arginine, 
and amino acids with uncharged polar head groups having similar hydrophilicity values 
may include leucine, isoleucine, and valine; glycine and alanine; asparagine and 
glutamine; serine and threonine; and phenylalanine and tyrosine. 
5 The terms "amino acid" or "amino acid sequence," as used herein, refer to an 

oligopeptide, peptide, polypeptide, or protein sequence, or a fragment of any of these, and 
to naturally occurring or synthetic molecules. In this context, "fragments", "immunogenic 
fragments", or "antigenic fragments" refer to fragments of TRANP which are preferably 
about 5 to about 1 5 amino acids in length and which retain some biological activity or 

10 immunological activity of TRANP. Where "amino acid sequence" is recited herein to 
refer to an amino acid sequence of a naturally occurring protein molecule, "amino acid 
sequence" and like terms are not meant to limit the amino acid sequence to the complete 
native amino acid sequence associated with the recited protein molecule. 

"Amplification," as used herein, relates to the production of additional copies of a 

15 nucleic acid sequence. Amplification is generally carried out using polymerase chain 
reaction (PCR) technologies well known in the art. (See, e.g., Dieffenbach, C.W. and 
G.S. Dveksler (1995) PCR Primer, a Laboratory Manual . Cold Spring Harbor Press, 
Plainview, NY, pp.1-5.) 

The term "antagonist," as it is used herein, refers to a molecule which, when bound 

20 to TRANP, decreases the amount or the duration of the effect of the biological or 
immunological activity of TRANP. Antagonists may include proteins, nucleic acids, 
carbohydrates, antibodies, or any other molecules which decrease the effect of TRANP. 

As used herein, the term "antibody" refers to intact molecules as well as to 
fragments thereof, such as Fa, F(ab') 2 , and Fv fragments, which are capable of binding the 

25 epitopic determinant. Antibodies that bind TRANP polypeptides can be prepared using 
intact polypeptides or using fragments containing small peptides of interest as the 
immunizing antigen. The polypeptide or oligopeptide used to immunize an animal (e.g., a 
mouse, a rat, or a rabbit) can be derived from the translation of RNA, or synthesized 
chemically, and can be conjugated to a carrier protein if desired. Commonly used carriers 

30 that are chemically coupled to peptides include bovine serum albumin, thyroglobulin, and 
keyhole limpet hemocyanin (KLH). The coupled peptide is then used to immunize the 
animal. 
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The term "antigenic determinant," as used herein, refers to that fragment of a 
molecule (i.e., an epitope) that makes contact with a particular antibody. When a protein 
or a fragment of a protein is used to immunize a host animal, numerous regions of the 
protein may induce the production of antibodies which bind specifically to antigenic 
5 determinants (given regions or three-dimensional structures on the protein). An antigenic 
determinant may compete with the intact antigen (i.e., the immunogen used to elicit the 
immune response) for binding to an antibody. 

The term "antisense," as used herein, refers to any composition containing a 
nucleic acid sequence which is complementary to a specific nucleic acid sequence. The 
10 term "antisense strand" is used in reference to a nucleic acid strand that is complementary 
to the "sense" strand. Antisense molecules may be produced by any method including 
synthesis or transcription. Once introduced into a cell, the complementary nucleotides 
combine with natural sequences produced by the cell to form duplexes and to block either 
transcription or translation. The designation "negative" can refer to the antisense strand, 
1 5 and the designation "positive" can refer to the sense strand. 

As used herein, the term "biologically active," refers to a protein having structural, 
regulatory, or biochemical functions of a naturally occurring molecule. Likewise, 
"immunologically active" refers to the capability of the natural, recombinant, or synthetic 
TRANP, or of any oligopeptide thereof, to induce a specific immune response in 
20 appropriate animals or cells and to bind with specific antibodies. 

The terms "complementary" or "complementarity," as used herein, refer to the 
natural binding of polynucleotides under permissive salt and temperature conditions by 
base pairing. For example, the sequence "A-G-T" binds to the complementary sequence 
"T-C-A." Complementarity between two single-stranded molecules may be "partial," 
25 such that only some of the nucleic acids bind, or it may be "complete," such that total 
complementarity exists between the single stranded molecules. The degree of 
complementarity between nucleic acid strands has significant effects on the efficiency and 
strength of the hybridization between the nucleic acid strands. This is of particular 
importance in amplification reactions, which depend upon binding between nucleic acids 
30 strands, and in the design and use of peptide nucleic acid (PNA) molecules. 

A "composition comprising a given polynucleotide sequence" or a "composition 
comprising a given amino acid sequence," as these terms are used herein, refer broadly to 
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any composition containing the given polynucleotide or amino acid sequence. The 
composition may comprise a dry formulation, an aqueous solution, or a sterile 
composition. Compositions comprising polynucleotide sequences encoding TRANP or 
fragments of TRANP may be employed as hybridization probes. The probes may be stored 
5 in freeze-dried form and may be associated with a stabilizing agent such as a carbohydrate. 
In hybridizations, the probe may be deployed in an aqueous solution containing salts (e.g., 
NaCl), detergents (e.g., SDS), and other components (e.g., Denhardt's solution, dry milk, 
salmon sperm DNA, etc.). 

The phrase "consensus sequence," as used herein, refers to a nucleic acid sequence 
10 which has been resequenced to resolve uncalled bases, extended using XL-PCR™ (Perkin 
Elmer, Norwalk, CT) in the 5' and/or the 3 f direction, and resequenced, or which has been 
assembled from the overlapping sequences of more than one Incyte Clone using a 
computer program for fragment assembly, such as the GEL VIEW™ Fragment Assembly 
system (GCG, Madison, WI). Some sequences have been both extended and assembled to 
15 produce the consensus sequence . 

As used herein, the term "correlates with expression of a polynucleotide" indicates 
that the detection of the presence of nucleic acids, the same or related to a nucleic acid 
sequence encoding TRANP, by northern analysis is indicative of the presence of nucleic 
acids encoding TRANP in a sample, and thereby correlates with expression of the 
transcript from the polynucleotide encoding TRANP. 

A "deletion," as the term is used herein, refers to a change in the amino acid or 
nucleotide sequence that results in the absence of one or more amino acid residues or 
nucleotides. 

The term "derivative," as used herein, refers to the chemical modification of 
TRANP, of a polynucleotide sequence encoding TRANP, or of a polynucleotide sequence 
complementary to a polynucleotide sequence encoding TRANP. Chemical modifications 
of a polynucleotide sequence can include, for example, replacement of hydrogen by an 
alkyl, acyl, or amino group. A derivative polynucleotide encodes a polypeptide which 
retains at least one biological or immunological function of the natural molecule. A 
derivative polypeptide is one modified by glycosylation, pegylation, or any similar process 
that retains a at least one biological or immunological function of the polypeptide from 
which it was derived. 
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The term "homology," as used herein, refers to a degree of complementarity. 
There may be partial homology or complete homology. The word "identity" may 
substitute for the word "homology." A partially complementary sequence that at least 
partially inhibits an identical sequence from hybridizing to a target nucleic acid is referred 
5 to as "substantially homologous." The inhibition of hybridization of the completely 
complementary sequence to the target sequence may be examined using a hybridization 
assay (Southern or northern blot, solution hybridization, and the like) under conditions of 
reduced stringency. A substantially homologous sequence or hybridization probe will 
compete for and inhibit the binding of a completely homologous sequence to the target 

10 sequence under conditions of reduced stringency. This is not to say that conditions of 
reduced stringency are such that non-specific binding is permitted, as reduced stringency 
conditions require that the binding of two sequences to one another be a specific (i.e., a 
selective) interaction. The absence of non-specific binding may be tested by the use of a 
second target sequence which lacks even a partial degree of complementarity (e.g., less 

15 than about 30% homology or identity). In the absence of non-specific binding, the 
substantially homologous sequence or probe will not hybridize to the second non- 
complementary target sequence. 

The phrases "percent identity" or "% identity" refer to the percentage of sequence 
similarity found in a comparison of two or more amino acid or nucleic acid sequences. 

20 Percent identity can be determined electronically, e.g., by using the MegAlign program 
(DNASTAR, Inc., Madison WI). The MegAlign program can create alignments between 
two or more sequences according to different methods, e.g., the clustal method. (Higgins, 
D.G. and P. M. Sharp (1988) Gene 73:237-244.) The clustal algorithm groups sequences 
into clusters by examining the distances between all pairs. The clusters are aligned 

25 pairwise and then in groups. The percentage similarity between two amino acid 
sequences, e.g., sequence A and sequence B, is calculated by dividing the length of 
sequence A, minus the number of gap residues in sequence A, minus the number of gap 
residues in sequence B, into the sum of the residue matches between sequence A and 
sequence B, times one hundred. Gaps of low or of no homology between the two amino 

30 acid sequences are not included in determining percentage similarity. Percent identity 
between nucleic acid sequences can also be calculated or counted by the clustal method, or 
by other methods known in the art. such as the Jotun Hein Method. (See, e.g., Hein, J. 
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(1990) Methods in Enzymology 183:626-645.) Identity between sequences can also be 
determined by other methods known in the art, e.g., by varying hybridization conditions. 

"Human artificial chromosomes" (HACs), as described herein, are linear 
microchromosomes which may contain DNA sequences of about 6 kb to 10 Mb in size, 
5 and which contain all of the elements required for stable mitotic chromosome segregation 
and maintenance. (See, e.g., Harrington, JJ. et al. (1997) Nat Genet. 15:345-355.) 

The term "humanized antibody," as used herein, refers to antibody molecules in 
which the amino acid sequence in the non-antigen binding regions has been altered so that 
the antibody more closely resembles a human antibody, and still retains its original 
10 binding ability. 

"Hybridization," as the term is used herein, refers to any process by which a strand 
of nucleic acid binds with a complementary strand through base pairing. 

As used herein, the term "hybridization complex" as used herein, refers to a 
complex formed between two nucleic acid sequences by virtue of the formation of 

15 hydrogen bonds between complementary bases. A hybridization complex may be formed 
in solution (e.g., C 0 t or R^t analysis) or formed between one nucleic acid sequence present 
in solution and another nucleic acid sequence immobilized on a solid support (e.g., paper, 
membranes, filters, chips, pins or glass slides, or any other appropriate substrate to which 
cells or their nucleic acids have been fixed). 

20 The words "insertion" or "addition," as used herein, refer to changes in an amino 

acid or nucleotide sequence resulting in the addition of one or more amino acid residues or 
nucleotides, respectively, to the sequence found in the naturally occurring molecule. 

"Immune response" can refer to conditions associated with inflammation, trauma, 
immune disorders, or infectious or genetic disease, etc. These conditions can be 

25 characterized by expression of various factors, e.g., cytokines, chemokines, and other 
signaling molecules, which may affect cellular and systemic defense systems. 

The term "microarray," as used herein, refers to an array of distinct polynucleotides 
or oligonucleotides arrayed on a substrate, such as paper, nylon or any other type of 
membrane, filter, chip, glass slide, or any other suitable solid support. 

30 The term "modulate," as it appears herein, refers to a change in the activity of 

TRANP. For example, modulation may cause an increase or a decrease in protein activity, 
binding characteristics, or any other biological, functional, or immunological properties of 
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TRANP. 

The phrases "nucleic acid" or "nucleic acid sequence," as used herein, refer to an 
oligonucleotide, nucleotide, polynucleotide, or any fragment thereof, to DNA or RNA of 
genomic or synthetic origin which may be single-stranded or double-stranded and may 
5 represent the sense or the antisense strand, to peptide nucleic acid (PNA), or to any DNA- 
like or RNA-like material. In this context, "fragments" refers to those nucleic acid 
sequences which are greater than about 60 nucleotides in length, and most preferably are at 
least about 100 nucleotides, at least about 1000 nucleotides, or at least about 10,000 
nucleotides in length. 

10 The terms "operably associated" or "operably linked," as used herein, refer to 

functionally related nucleic acid sequences. A promoter is operably associated or operably 
linked with a coding sequence if the promoter controls the transcription of the encoded 
polypeptide. While operably associated or operably linked nucleic acid sequences can be 
contiguous and in reading frame, certain genetic elements, e.g., repressor genes, are not 

15 contiguously linked to the encoded polypeptide but still bind to operator sequences that 
control expression of the polypeptide. 

The term "oligonucleotide," as used herein, refers to a nucleic acid sequence of at 
least about 6 nucleotides to 60 nucleotides, preferably about 15 to 30 nucleotides, and 
most preferably about 20 to 25 nucleotides, which can be used in PCR amplification or in 

20 a hybridization assay or microarray. As used herein, the term "oligonucleotide" is 

substantially equivalent to the terms "ampiimers," "primers," "oligomers," and "probes," 
as these terms are commonly defined in the art. 

"Peptide nucleic acid" (PNA), as used herein, refers to an antisense molecule or 
anti-gene agent which comprises an oligonucleotide of at least about 5 nucleotides in 

25 length linked to a peptide backbone of amino acid residues ending in lysine. The terminal 
lysine confers solubility to the composition. PNAs preferentially bind complementary 
single stranded DNA and RNA and stop transcript elongation, and may be pegylated to 
extend their lifespan in the cell. (See, e.g., Nielsen, P.E. et al. (1993) Anticancer Drug 
Des. 8:53-63.) 

30 The term "sample," as used herein, is used in its broadest sense. A biological 

sample suspected of containing nucleic acids encoding TRANP, or fragments thereof, or 
TRANP itself may comprise a bodily fluid; an extract from a cell, chromosome, organelle, 
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or membrane isolated from a cell; a cell; genomic DNA, RNA, or cDNA, in solution or 
bound to a solid support; a tissue; a tissue print; etc. 

As used herein, the terms "specific binding" or "specifically binding" refer to that 
interaction between a protein or peptide and an agonist, an antibody, or an antagonist. The 
5 interaction is dependent upon the presence of a particular structure of the protein 
recognized by the binding molecule (i.e., the antigenic determinant or epitope). For 
example, if an antibody is specific for epitope "A," the presence of a polypeptide 
containing the epitope A, or the presence of free unlabeled A, in a reaction containing free 
labeled A and the antibody will reduce the amount of labeled A that binds to the antibody. 

10 As used herein, the term "stringent conditions" refers to conditions which permit 

hybridization between polynucleotide sequences and the claimed polynucleotide 
sequences. Suitably stringent conditions can be defined by, for example, the 
concentrations of salt or formamide in the prehybridization and hybridization solutions, or 
by the hybridization temperature, and are well known in the art. In particular, stringency 

15 can be increased by reducing the concentration of salt, increasing the concentration of 
formamide, or raising the hybridization temperature. 

For example, hybridization under high stringency conditions could occur in about 
50% formamide at about 37°C to 42°C. Hybridization could occur under reduced 
stringency conditions in about 35% to 25% formamide at about 30°C to 35°C. In 

20 particular, hybridization could occur under high stringency conditions at 42°C in 50% 
formamide, 5X SSPE, 0.3% SDS, and 200 jug/ml sheared and denatured salmon sperm 
DNA. Hybridization could occur under reduced stringency conditions as described above, 
but in 35% formamide at a reduced temperature of 35°C. The temperature range 
corresponding to a particular level of stringency can be further narrowed by calculating the 

25 purine to pyrimidine ratio of the nucleic acid of interest and adjusting the temperature 
accordingly. Variations on the above ranges and conditions are well known in the art. 

The term "substantially purified," as used herein, refers to nucleic acid or amino 
acid sequences that are removed from their natural environment and are isolated or 
separated, and are at least about 60% free, preferably about 75% free, and most preferably 

30 about 90% free from other components with which they are naturally associated. 

A "substitution," as used herein, refers to the replacement of one or more amino 
acids or nucleotides by different amino acids or nucleotides, respectively. 
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"Transformation," as defined herein, describes a process by which exogenous DNA 
enters and changes a recipient cell. Transformation may occur under natural or artificial 
conditions according to various methods well known in the art, and may rely on any 
known method for the insertion of foreign nucleic acid sequences into a prokaryotic or 

5 eukaryotic host cell. The method for transformation is selected based on the type of host 
cell being transformed and may include, but is not limited to, viral infection, 
electroporation, heat shock, lipofection, and particle bombardment. The term 
"transformed" cells includes stably transformed cells in which the inserted DNA is capable 
of replication either as an autonomously replicating plasmid or as part of the host 

10 chromosome, and refers to cells which transiently express the inserted DNA or RNA for 
limited periods of time. 

A "variant" of TRANP, as used herein, refers to an amino acid sequence that is 
altered by one or more amino acids. The variant may have "conservative" changes, 
wherein a substituted amino acid has similar structural or chemical properties (e.g., 

15 replacement of leucine with isoleucine). More rarely, a variant may have 

"nonconservative" changes (e.g., replacement of glycine witl tryptophan). Analogous 
minor variations may also include amino acid deletions or insertions, or both. Guidance in 
determining which amino acid residues may be substituted, inserted, or deleted without 
abolishing biological or immunological activity may be found using computer programs 

20 well known in the art, for example, DNASTAR software. 

THE INVENTION 

The invention is based on the discovery of new human transport-associated 
molecules (TRANP), the polynucleotides encoding TRANP, and the use of these 
25 compositions for the diagnosis, treatment, or prevention of cancer and transport disorders. 
Table 1 shows the sequence identification numbers, Incyte Clone identification number, 
and cDNA library for each of the human transport-associated molecules disclosed herein. 

Table 1 

30 



PROTEIN 


NUCLEOTIDE 


CLONE ID 


LIBRARY 


SEQ IDNO:l 


SEQ ID NO: 10 


144861 


TLYMNOR01 
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»EQ ID NO: 1 1 


607812 


COLNNOT01 


SEQ IDNO:3 


SEQ ID NO: 12 


1259384 


MtlNl 1 U 1 03 


SEQ ID NO:4 


SEQ ID NO: 13 


1340813 


COLNTUT03 


SEQ ID NO:5 


SEQ ID NO: 14 


1689731 


PROSTUT10 


SEQ ID NO:6 


SEQ ID NO: 15 


2751730 


THPIAZS08 


SEQ ID NO:7 


SEQ ID NO: 16 


2794975 


NPOLNOT01 


SEQ ID NO:8 


SEQ ID NO: 17 


2797710 


NPOLNOT01 


SEQIDNO:9 


SEQ ID NO: 18 


2914719 


THYMFET03 



10 Nucleic acids encoding the TRANP-1 of the present invention were first identified 

in Incyte Clone 144861 from the mononuclear cell cDNA library (TLYMNOR01) using a 
computer search for amino acid sequence alignments. A consensus sequence, SEQ ID 
NO: 10, was derived from the following overlapping and/or extended nucleic acid 
sequences: Incyte Clones 144861 (TLYMNOR01) and 156223 (THP1PLB02). 
1 5 In one embodiment, the invention encompasses a polypeptide comprising the 

amino acid sequence of SEQ ID NO: 1 . TRANP-1 is 383 amino acids in length and has 
two potential sugar transport protein signatures from residues L„ to R« 7 and from residues 
A 230 to S 246 . TRANP-1 has chemical and structural homology with a sugar transporter 
from sugar beet (GI 1209756). In particular, TRANP and GI 1209756; share 30% identity. 
20 The fragment of SEQ ID NO: 1 0 from about nucleotide 442 to nucleotide 483 is useful for 
hybridization. Northern analysis shows the expression of this sequence in 
hematopoietic/immune, cardiovascular, and urologic cDNA libraries. Approximately 60% 
of these libraries are associated with fetal tissues and proliferating cell lines and 40% with 
inflammation. 

Nucleic acids encoding the TRANP-2 of the present invention were first identified 
in Incyte Clone 607812 from the colon tissue cDNA library (COLNNOT01) using a 
computer search for amino acid sequence alignments. A consensus sequence, SEQ ID 
NO:l 1 , was derived from the following overlapping and/or extended nucleic acid 
sequences: Incyte Clones 607812 (COLNNOT01), 31761 17 (UTRSTUT04), and the 
30 shotgun sequences SAEA02352 and SAEC1 0448. 

In one embodiment, the invention encompasses a polypeptide comprising the 
amino acid sequence of SEQ ID NO:2. TRANP-2 is 272 amino acids in length and has a 
potential N-glycosolation site at residue N*. a potential casern kinase II phosphorylation 



25 
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site at residue T 48 , four potential protein kinase C phosphorylation sites at residues T 158 , 
Sim, T 25! , and S 365 , and a leucine zipper pattern from residue L 62 to residue L 83 . TRANP-2 
has chemical and structural homology with the C elegans ammonium transporter (GI 
1 125753). In particular, TRANP-2 and GI 1 125753 share 47% identity. The fragment of 
5 SEQ ID NO: 1 1 from about nucleotide 25 1 to nucleotide 274 is useful for hybridization. 
Northern analysis shows the expression of this sequence in reproductive, cardiovascular, 
gastrointestinal, neural, and urologic cDNA libraries. Approximately 67% of these 
libraries are associated with cancer, and 28% with fetal tissues and proliferating cell lines. 
Nucleic acids encoding the TRANP-3 of the present invention were first identified 

1 0 in Incyte Clone 1 259384 from the meningioma cDNA library (MENITUT03) using a 
computer search for amino acid sequence alignments. A consensus sequence, SEQ ID 
NO: 12, was derived from the following overlapping and/or extended nucleic acid 
sequences: Incyte Clones 1259384 (MENITUT03)and 2946062 (BRAITUT23). 

In one embodiment, the invention encompasses a polypeptide comprising the 

15 amino acid sequence of SEQ ID NO:3. TRANP-3 is 210 amino acids in length and has a 
potential amidation site at residue D 55 , a potential N-glycosolation site at residue N 83 , five 
potential casein kinase II phosphorylation sites at residues S 70 , S 86 , T 88 , S H5 , and S 199 , four 
potential protein kinase C phosphorylation sites at residues T 45 , S TO , T 85 , and S l99 , and two 
potential tyrosine phosphorylation sites at residues Y44 and Y, n . TRANP-3 has chemical 

20 and structural homology with the zeta-COP coatomer protein from cow (GI 441486). In 
particular, TRANP-3 and GI 441486 share 63% identity. The fragment of SEQ ID NO: 12 
from about nucleotide 63 to nucleotide 125 is useful for hybridization. Northern analysis 
shows the expression of this sequence in reproductive, neural, cardiovascular, 
gastrointestinal, developmental, endocrine, musculoskeletal, and hematopoietic/immune 

25 cDNA libraries. Approximately 59% of these libraries are associated with cancer, and 
1 8% with fetal tissues and proliferating cell lines. 

Nucleic acids encoding the TRANP-4 of the present invention were first identified 
in Incyte Clone 1340813 from the colon tumor cDNA library (COLNTUT03) using a 
computer search for amino acid sequence alignments. A consensus sequence, SEQ ID 

30 NO: 13 , was derived from the following overlapping and/or extended nucleic acid 
sequences: Incyte Clones 1340813 and 1343253 (COLNTUT03), 1515390 
(PANCTUT01), 7581 10 (BRAITUT02), 1573508 (LNODNOT03), 1855515 ( 
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HNT3AZT01), and 1971226 (UCMCL5T01). 

In one embodiment, the invention encompasses a polypeptide comprising the 
amino acid sequence of SEQ ID NO:4. TRANP-4 is 465 amino acids in length and has 
chemical and structural homology with the rat monocarboxylate transporter (GI 2463651). 
In particular, TRANP-4 and GI 246365 1 share 90% identity. The fragment of SEQ ID 
NO: 13 from about nucleotide 674 to nucleotide 694 is useful for hybridization. Northern 
analysis shows the expression of this sequence in hematopoietic/immune, gastrointestinal, 
reproductive, cardiovascular, neural, musculoskeletal, developmental, urologic, and 
dermatologic cDNA libraries. Approximately 41% of these libraries are associated with 
cancer, 32% with inflammation, and 23% with fetal tissues and proliferating cell lines. 

Nucleic acids encoding the TRANP-5 of the present invention were first identified 
in Incyie Clone 1689731from the prostate tumor cDNA library (PROSTUT10) using a 
computer search for amino acid sequence alignments. A consensus sequence, SEQ ID 
NO: 14, was derived from the following overlapping and/or extended nucleic acid 
sequences: Incyte Clones 1689731 (PROSTUT10), 1572293 (UTRSNOT05), 1807228 
(SINTNOT13), 1556505 (BLADTUT04), 1295734 (PGANNOT03) 1721666 
(BLADNOT06), and the shotgun sequences SAEA02030, SAEA00859, and SAEA02913. 

In one embodiment, the invention encompasses a polypeptide comprising the 
amino acid sequence of SEQ ID NO:5. TRANP-5 is 237 amino acids in length and has a 
potential ATP/GTP-binding site motif from residues G l6 to S 23 , and a potential prenylation 
site from residues C 23 « to P m TRANP-5 has chemical and structural homology with 
mouse rab23 (GI 438162). In particular, TRANP-5 and GI 438 1 62 share 93% identity. 
The fragment of SEQ ID NO: 14 from about nucleotide 721 to nucleotide 759 is useful for 
hybridization. Northern analysis shows the expression of this sequence in reproductive, 
cardiovascular, neural, gastrointestinal, urologic, developmental, endocrine, and 
dermatological cDNA libraries. Approximately 53% of these libraries are associated with 
cancer, 13% with inflammation, and 13% with fetal tissues and proliferating cell lines. 

Nucleic acids encoding the TRANP-6 of the present invention were first identified 
in Incyte Clone 2751730 from the mononuclear cell cDNA library (THP1AZS08) using a 
computer search for amino acid sequence alignments. A consensus sequence, SEQ ID 
NO: 15, was derived from the following overlapping and/or extended nucleic acid 
sequences: Incyte Clones 2751730 and 2754090 (THP1AZS08), 2174086 (ENDCNOT03), 
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and 342996 (BRSTNOR01). 

In one embodiment, the invention encompasses a polypeptide comprising the 
amino acid sequence of SEQ ID NO:6. TRANP-6 is 208 amino acids in length and has a 
potential ATP/GTP -binding site motif from residues G 20 to TV? . TRANP-6 has chemical 
5 and structural homology with human GTP binding protein (GI 550072). In particular, 
TRANP-6 and GI 550072 share 90% identity. The fragment of SEQ ID NO: 15 from about 
nucleotide 551 to nucleotide 577 is usefiil for hybridization. Northern analysis shows the 
expression of this sequence in neural, reproductive, hematopoietic/immune, 
developmental, and dermatologic cDNA libraries. Approximately 43% of these libraries 

10 are associated with fetal tissues and proliferating cell lines, and 28% with cancer. 

Nucleic acids encoding the TRANP-7 of the present invention were first identified 
in Incyte Clone 2794975 from the nasal polyp tissue cDNA library (NPOLNOT01) using a 
computer search for amino acid sequence alignments. A consensus sequence, SEQ ID 
NO: 16, was derived from the following overlapping and/or extended nucleic acid 

15 sequences: Incyte Clones 2794975 (NPOLNOT01), 3722893 (BRSTNOT23), 2922316 
(SININOT04), 2624003 (KERANOT02), 1794517 (THYRNOT03), 1416892 
(BRAINOT12), 1418058 (KIDNNOT09), 008439 (HMC1NOT01), 1873165 
(LEUKNOT02), and 1370761 (BSTMNON02). 

In one embodiment, the invention encompasses a polypeptide comprising the 

20 amino acid sequence of SEQ ID NO:7. TRANP-7 is 709 amino acids in length and has 
two potential ABC transporter family signatures from residues F 328 to L 339 and from 
residues L 6I1 toT 625 , and two potential ATP/GTP- binding site motifs from residues G 2I0 
toT 217 and from residues G 525 to S 532 . TRANP-7 has chemical and structural homology 
with a CL elepans putative ATP-binding transport protein (GI 500734). In particular, 

25 TRANP-7 and GI 500734 share 47% identity. The fragment of SEQ ID NO: 1 6 from about 
nucleotide 1 92 to nucleotide 227 is useful for hybridization. Northern analysis shows the 
expression of this sequence in neural, reproductive, hematopoietic/immune, 
gastrointestinal, and cardiovascular cDNA libraries. Approximately 38% of these libraries 
are associated with cancer, 24% with inflammation, and 22% with fetal tissues and 

30 proliferating cell lines. 

Nucleic acids encoding the TRANP-8 of the present invention were first identified 
in Incyte Clone 2797710 from the nasal polyp tissue cDNA library (NPOLNOT01) using a 
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computer search for amino acid sequence alignments. A consensus sequence, SEQ ID 
NO: 17, was derived from the following overlapping and/or extended nucleic acid 
sequences: Incyte Clones 2797710 (NPOLNOT01), 879491 (THYRNOT02), 234596 
(SINTNOT02), 251 188 (PANCDITOI), 036294 (HUVENOB01), 2939073 
5 (THYMFET02), 2019482 (CONNNOT01), 2171995 (ENDCNOT03), 517825 
(MMLR1DT01), 1714811 (UCMCNOT02), 371259 (LUNGNOT02), 1995866 
(BRSTTUT03), 2642178 (LUNGTUT08), and the shotgun sequences SADA01064 and 
SADA00379. 

In one embodiment, the invention encompasses a polypeptide comprising the 
10 amino acid sequence of SEQ ID NO:8. TRANP-8 is 962 amino acids in length and has 
five potential N-glycosylation sites at residues N I0I , N I23 , N 243 , N 45l , and N tt2 , one potential 
tyrosine kinase phosphorylation site at residue Y 81> eighteen potential casein kinase II 
phosphorylation sites at residues T l8 , T*, T 74 , S 91 , S 129> T 336 , T 410 , T 453 , S 585 , S 63I , S 632 , S 7I7 , 
T ?54. S 75g , S 780 , Tg^, T 890 , and S^. TRANP-8 has chemical and structural homology with 
15 the rat pi 15 vesicular transport factor (GI 538153). In particular, TRANP-8 and 
GI 538153 share 93% identity. The fragment of SEQ ID NO: 17 from about nucleotide 
2688 to nucleotide 2723 is useful for hybridization. Northern analysis shows the 
expression of this sequence in reproductive, neural, gastrointestinal, 
hematopoietic/immune, and cardiovascular cDNA libraries. Approximately 45% of these 
20 libraries are associated with cancer, 30% with inflammation, and 1 7% with fetal tissues 
and proliferating cell lines. 

Nucleic acids encoding the TRANP-9 of the present invention were first identified 
in Incyte Clone 2914719 from the fetal thymus cDNA library (THYMFET03) using a 
computer search for amino acid sequence alignments. A consensus sequence, SEQ ID 
25 NO: 1 8, was derived from the following overlapping and/or extended nucleic acid 
sequences: Incyte Clones 2914719 (THYMFET03), 264268 (HNT2AGT01), 1553167 
(BLADTUT04), 550095 (BEPINOT01), 2148915 (BRAINOT09), and 588157 
(UTRSNOT01). 

In one embodiment, the invention encompasses a polypeptide comprising the 
30 amino acid sequence of SEQ ID NO:9. TRANP-9 is 368 amino acids in length and has 
two potential beta transducin family Trp-Asp (WD-40) repeats from residues V 101 to L 115 
and from residues V l44 to T 158 . TRANP-9 has chemical and structural homology with £. 
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elegans polyA RNA export protein (GI 1546734). In particular, TRANP-9 and 
GI 1546734 share 47% identity. The fragment of SEQ ID NO: 18 from about nucleotide 
203 to nucleotide 265 is useful for hybridization. Northern analysis shows the expression 
of this sequence in reproductive, neural, hematopoietic/immune, and gastrointestinal 
5 cDNA libraries. Approximately 39% of these libraries are associated with cancer, 33% 
with inflammation, and 23% with fetal tissues and proliferating cell lines. 

The invention also encompasses TRANP variants. A preferred TRANP variant is 
one which has at least about 80%, more preferably at least about 90%, and most preferably 
at least about 95% amino acid sequence identity to the TRANP amino acid sequence, and 
1 0 which contains at least one functional or structural characteristic of TRANP. 

The invention also encompasses polynucleotides which encode TRANP. In a 
particular embodiment, the invention encompasses a polynucleotide consisting of a nucleic 
acid sequence selected from group consisting of SEQ ID NO: 10, SEQ ID NO:l 1, SEQ ID 
NO:12, SEQ ID NO: 13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, 
15 and SEQ ID NO: 18. 

The invention also encompasses a variant of a polynucleotide sequence encoding 
TRANP. In particular, such a variant polynucleotide sequence will have at least about 
80%, more preferably at least about 90%, and most preferably at least about 95% 
polynucleotide sequence identity to the polynucleotide sequence encoding TRANP. A 
20 particular aspect of the invention encompasses a variant of a polynucleotide sequence 
selected from the group consisting of SEQ ID NO: 10, SEQ ID NO:l 1, SEQ ID NO: 12, 
SEQ ID NO:13, SEQ ID NO: 14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, and 
SEQ ID NO: 18, which has at least about 80%, more preferably at least about 90%, and 
most preferably at least about 95% polynucleotide sequence identity to the polynucleotide 
25 sequence encoding TRANP. Any one of the polynucleotide variants described above can 
encode an amino acid sequence which contains at least one functional or structural 
characteristic of TRANP. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of 
the genetic code, a multitude of polynucleotide sequences encoding TRANP, some bearing 
30 minimal homology to the polynucleotide sequences of any known and naturally occurring 
gene, may be produced. Thus, the invention contemplates each and every possible 
variation of polynucleotide sequence that could be made by selecting combinations based 
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on possible codon choices. These combinations are made in accordance with the standard 
triplet genetic code as applied to the polynucleotide sequence of naturally occurring 
TRANP, and all such variations are to be considered as being specifically disclosed. 
Although nucleotide sequences which encode TRANP and its variants are 
5 preferably capable of hybridizing to the nucleotide sequence of the naturally occurring 
TRANP under appropriately selected conditions of stringency, it may be advantageous to 
produce nucleotide sequences encoding TRANP or its derivatives possessing a 
substantially different codon usage. Codons may be selected to increase the rate at which 
expression of the peptide occurs in a particular prokaryotic or eukaryotic host in 

10 accordance with the frequency with which particular codons are utilized by the host. 
Other reasons for substantially altering the nucleotide sequence encoding TRANP and its 
derivatives without altering the encoded amino acid sequences include the production of 
RNA transcripts having more desirable properties, such as a greater half-life, than 
transcripts produced from the naturally occurring sequence. 

1 5 The invention also encompasses production of DNA sequences which encode 

TRANP and TRANP derivatives, or fragments thereof, entirely by synthetic chemistry. 
After production, the synthetic sequence may be inserted into any of the many available 
expression vectors and cell systems using reagents that are well known in the art. 
Moreover, synthetic chemistry may be used to introduce mutations into a sequence 

20 encoding TRANP or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable 
of hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown 
in SEQ ID NO: 10, SEQ ID NO:l 1, SEQ ID NO: 12, SEQ ID NO.T3, SEQ ID NO: 14, SEQ 
ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, or SEQ ID NO: 18, and fragments thereof, 

25 under various conditions of stringency. (See, e.g., Wahl, G.M. and S.L. Berger (1987) 
Methods Enzymol. 152:399-407; and Kimmel, A.R. (1987) Methods Enzymol. 152:507- 
511.) 

Methods for DNA sequencing are well known and generally available in the art 
and may be used to practice any of the embodiments of the invention. The methods may 
30 employ such enzymes as the Klenow fragment of DNA polymerase I, Sequenase® (US 
Biochemical Corp., Cleveland, OH), Taq polymerase (Perkin Elmer), thermostable T7 
polymerase (Amersham, Chicago, IL), or combinations of polymerases and proofreading 
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exonucleases such as those found in the ELONGASE Amplification System (GlBCO/BRL, 
Gaithersburg, MD). Preferably, the process is automated with machines such as the 
Hamilton Micro Lab 2200 (Hamilton, Reno, NV), Peltier Thermal Cycler (PTC200; MJ 
Research, Watertown, MA) and the ABI Catalyst and 373 and 377 DNA Sequencers 
5 (Perkin Elmer). 

The nucleic acid sequences encoding TRANP may be extended utilizing a partial 
nucleotide sequence and employing various methods known in the art to detect upstream 
sequences, such as promoters and regulatory elements. For example, one method which 
may be employed, restriction-site PCR, uses universal primers to retrieve unknown 

10 sequence adjacent to a known locus. (See, e.g., Sarkar, G. (1993) PCR Methods Applic. 
2:3 1 8-322.) In particular, genomic DNA is first amplified in the presence of a primer 
complementary to a linker sequence within the vector and a primer specific to the region 
predicted to encode the gene. The amplified sequences are then subjected to a second 
round of PCR with the same linker primer and another specific primer internal to the first 

15 one. Products of each round of PCR are transcribed with an appropriate RNA polymerase 
and sequenced using reve .se transcriptase. 

Inverse PCR may also be used to amplify or extend sequences using divergent 
primers based on a known region. (See, e.g., Triglia, T. et al. (1988) Nucleic Acids Res. 
16:8186.) The primers may be designed using commercially available software such as 

20 OLIGO 4.06 Primer Analysis software (National Biosciences Inc., Plymouth, MN) or 
another appropriate program to be about 22 to 30 nucleotides in length, to have a GC 
content of about 50% or more, and to anneal to the target sequence at temperatures of 
about 68°C to 72°C. The method uses several restriction enzymes to generate a suitable 
fragment in the known region of a gene. The fragment is then circularized by 

25 intramolecular ligation and used as a PCR template. 

Another method which may be used is capture PCR, which involves PCR 
amplification of DNA fragments adjacent to a known sequence in human and yeast 
artificial chromosome DNA. (See, e.g., Lagerstrom, M. et al. (1991) PCR Methods 
Applic. 1:111-119.) In this method, multiple restriction enzyme digestions and ligations 

30 may be used to place an engineered double-stranded sequence into an unknown fragment 
of the DNA molecule before performing PCR. Other methods which may be used to 
retrieve unknown sequences are known in the art (See, e.g., Parker, J.D. etal. (1991) 
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Nucleic Acids Res. 19:3055-3060.) Additionally, one may use PCR, nested primers, and 
PromoterFinder™ libraries to walk genomic DNA (Clontech, Palo Alto, CA). This 
process avoids the need to screen libraries and is useful in finding intron/exon junctions. 
When screening for full-length cDNAs, it is preferable to use libraries that have 
5 been size-selected to include larger cDNAs. Also, random-primed libraries are preferable 
in that they will include more sequences which contain the 5 1 regions of genes. Use of a 
randomly primed library may be especially preferable for situations in which an oligo d(T) 
library does not yield a full-length cDNA. Genomic libraries may be useful for extension 
of sequence into 5 l non-transcribed regulatory regions. 

10 Capillary electrophoresis systems which are commercially available may be used 

to analyze the size or confirm the nucleotide sequence of sequencing or PCR products. In 
particular, capillary sequencing may employ flowable polymers for electrophoretic 
separation, four different fluorescent dyes (one for each nucleotide) which are laser 
activated, and a charge coupled device camera for detection of the emitted wavelengths. 

15 Output/light intensity may be converted to electrical signal using appropriate software 
(e.g., Genotyper™ and Sequence Navigator™, Perkin Elmer), and the entire process from 
loading of samples to computer analysis and electronic data display may be computer 
controlled. Capillary electrophoresis is especially preferable for the sequencing of small 
pieces of DNA which might be present in limited amounts in a particular sample. 

20 In another embodiment of the invention, polynucleotide sequences or fragments 

thereof which encode TRANP may be used in recombinant DNA molecules to direct 
expression of TRANP, or fragments or functional equivalents thereof, in appropriate host 
cells. Due to the inherent degeneracy of the genetic code, other DNA sequences which 
encode substantially the same or a functionally equivalent amino acid sequence may be 

25 produced, and these sequences may be used to clone and express TRANP. 

As will be understood by those of skill in the art, it may be advantageous to 
produce TRANP-encoding nucleotide sequences possessing non-naturally occuning 
codons. For example, codons preferred by a particular prokaryotic or eukaryotic host can 
be selected to increase the rate of protein expression or to produce an RNA transcript 

30 having desirable properties, such as a half-life which is longer than that of a transcript 
generated from the naturally occurring sequence. 

The nucleotide sequences of the present invention can be engineered using 
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methods generally known in the art in order to alter TRANP-encoding sequences for a 
variety of reasons including, but not limited to, alterations which modify the cloning, 
processing, and/or expression of the gene product. DNA shuffling by random 
fragmentation and PCR reassembly of gene fragments and synthetic oligonucleotides may 
5 be used to engineer the nucleotide sequences. For example, site-directed mutagenesis may 
be used to insert new restriction sites, alter glycosylation patterns, change codon 
preference, produce splice variants, introduce mutations, and so forth. 

In another embodiment of the invention, natural, modified, or recombinant nucleic 
acid sequences encoding TRANP may be ligated to a heterologous sequence to encode a 

10 fusion protein. For example, to screen peptide libraries for inhibitors of TRANP activity, 
it may be useful to encode a chimeric TRANP protein that can be recognized by a 
commercially available antibody. A fusion protein may also be engineered to contain a 
cleavage site located between the TRANP encoding sequence and the heterologous protein 
sequence, so that TRANP may be cleaved and purified away from the heterologous 

15 moiety. 

In another embodiment, sequences encoding TRANP may be synthesized, i;i whole 
or in part, using chemical methods well known in the art. (See, e.g., Caruthers, M.H. et al. 
(1980) Nucl. Acids Res. Symp. Ser. 215-223, and Horn, T. et al. (1980) Nucl. Acids Res. 
Symp. Ser. 225-232.) Alternatively, the protein itself may be produced using chemical 

20 methods to synthesize the amino acid sequence of TRANP, or a fragment thereof. For 
example, peptide synthesis can be performed using various solid-phase techniques. (See, 
e.g., Roberge, J.Y. et al. (1995) Science 269:202-204.) Automated synthesis may be 
achieved using the ABI 431 A Peptide Synthesizer (Perkin Elmer). 

The newly synthesized peptide may be substantially purified by preparative high 

25 performance liquid chromatography. (See, e.g, Chiez, R.M. and F.Z. Regnier (1990) 
Methods Enzymol. 182:392-421.) The composition of the synthetic peptides may be 
confirmed by amino acid analysis or by sequencing. (See, e.g., Creighton, T. (1983) 
Proteins. Str uctures and Molecular Properties. WH Freeman and Co., New York, NY.) 
Additionally, the amino acid sequence of TRANP, or any part thereof, may be altered 

30 during direct synthesis and/or combined with sequences from other proteins, or any part 
thereof, to produce a variant polypeptide. 

In order to express a biologically active TRANP, the nucleotide sequences 
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encoding TRANP or derivatives thereof may be inserted into appropriate expression 
vector, i.e., a vector which contains the necessary elements for the transcription and 
translation of the inserted coding sequence. 

Methods which are well known to those skilled in the art may be used to construct 

5 expression vectors containing sequences encoding TRANP and appropriate transcriptional 
and translational control elements. These methods include in vitro recombinant DNA 
techniques, synthetic techniques, and in vivo genetic recombination. (See, e.g., Sambrook, 
J. et al. (1989) Molecular Cloning. A Laboratory Manual . Cold Spring Harbor Press, 
Plainview, NY, ch. 4, 8, and 16-17; and Ausubel, F.M. et al. (1995, and periodic 

10 supplements) Current Protocols in Molecular Biology . John Wiley & Sons, New York, 
NY, ch. 9, 13, and 16.) 

A variety of expression vector/host systems may be utilized to contain and express 
sequences encoding TRANP. These include, but are not limited to, microorganisms such 
as bacteria transformed with recombinant bacteriophage, plasmid, or cosmid DNA 

15 expression vectors; yeast transformed with yeast expression vectors; insect cell systems 
infected with virus expression vectors (e.g., biiculovirus); plant cell systems transformed 
with virus expression vectors (e.g., cauliflower mosaic virus (CaMV) or tobacco mosaic 
virus (TMV)) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or 
animal cell systems. 

20 The invention is not limited by the host cell employed. 

The "control elements" or "regulatory sequences" are those non-translated regions, 
e.g., enhancers, promoters, and 5' and 3* untranslated regions, of the vector and 
polynucleotide sequences encoding TRANP which interact with host cellular proteins to 
carry out transcription and translation. Such elements may vary in their strength and 

25 specificity. Depending on the vector system and host utilized, any number of suitable 
transcription and translation elements, including constitutive and inducible promoters, may 
be used. For example, when cloning in bacterial systems, inducible promoters, e.g., hybrid 
lacZ promoter of the Bluescript® phagemid (Stratagene, La Jolla, CA) or pSportl™ 
plasmid (GlBCO/BRL), may be used. The baculovirus polyhedrin promoter may be used 

30 in insect cells. Promoters or enhancers derived from the genomes of plant cells (e.g., heat 
shock, RUBISCO, and storage protein genes) or from plant viruses (e.g., viral promoters 
or leader sequences) may be cloned into the vector. In mammalian cell systems, 
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promoters from mammalian genes or from mammalian viruses are preferable. If it is 
necessary to generate a cell line that contains multiple copies of the sequence encoding 
TRANP, vectors based on SV40 or EBV may be used with an appropriate selectable 
marker. 

5 In bacterial systems, a number of expression vectors may be selected depending 

upon the use intended for TRANP. For example, when large quantities of TRANP are 
needed for the induction of antibodies, vectors which direct high level expression effusion 
proteins that are readily purified may be used. Such vectors include, but are not limited to, 
multifunctional icoli cloning and expression vectors such as Bluescript® (Stratagene), 
10 in which the sequence encoding TRANP may be ligated into the vector in frame with 
sequences for the amino-terminal Met and the subsequent 7 residues of fi-galactosidase so 
that a hybrid protein is produced. (See, e.g., Van Heeke, G. and S.M. Schuster (1989) J. 
Biol. Chem. 264:5503-5509.) pGEX vectors (Pharmacia Biotech, Uppsala, Sweden) may 
also be used to express foreign polypeptides as fusion proteins with glutathione 
15 S-transferase (GST). In general, such fusion proteins are soluble and can easily be purified 
from lysel cells by adsorption to glutathione-agarose beads followed by elution in the 
presence of free glutathione. Proteins made in such systems may be designed to include 
heparin, thrombin, or factor XA protease cleavage sites so that the cloned polypeptide of 
interest can be released from the GST moiety at will. 
20 In the yeast Saccharomvces cerevisiae , a number of vectors containing constitutive 

or inducible promoters, such as alpha factor, alcohol oxidase, and PGH, may be used. 
(See, e.g., Ausubel, simja; and Grant et al. (1987) Methods Enzymol. 153:516-544.) 

In cases where plant expression vectors are used, the expression of sequences 
encoding TRANP may be driven by any of a number of promoters. For example, viral 
25 promoters such as the 35S and 19S promoters of CaMV may be used alone or in 

combination with the omega leader sequence from TMV. (Takamatsu, N. (1987) EMBO 
J. 6:307-3 1 1 .) Alternatively, plant promoters such as the small subunit of RUBISCO or 
heat shock promoters may be used. (See, e.g., Coruzzi, G. et al. (1984) EMBO J. 
3:1671-1680; Broglie, R. et al. (1984) Science 224:838-843; and Winter, J. et al. (1991) 
30 Results Probl. Cell Differ. 17:85-105.) These constructs can be introduced into plant cells 
by direct DNA transformation or pathogen-mediated transfection. Such techniques are 
described in a number of generally available reviews. (See, e.g., Hobbs, S. or Murry, L.E. 
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in McGraw Hill Yearbook of Science and Terhnnl npy (1 992) McGraw Hill, New York, 
NY; pp. 191-196.) 

An insect system may also be used to express TRANP. For example, in one such 
system, Aptographa California nuclear polyhedrosis virus (AcNPV) is used as a vector to 
5 express foreign genes in Spodoptera frugiperda cells or in Trichoplusia larv^ The 
sequences encoding TRANP may be cloned into a non-essential region of the virus, such 
as the polyhedrin gene, and placed under control of the polyhedrin promoter. Successful 
insertion of sequences encoding TRANP will render the polyhedrin gene inactive and 
produce recombinant virus lacking coat protein. The recombinant viruses may then be 
10 used to infect, for example, S. frugiperda cells or Trichoplusia larvae in which TRANP 
may be expressed. (See, e.g., Engelhard, E.K. et al. (1994) Proc. Nat. Acad. Sci. 
91:3224-3227.) 

In mammalian host cells, a number of viral-based expression systems may be 
utilized. In cases where an adenovirus is used as an expression vector, sequences encoding 

1 5 TRANP may be ligated into an adenovirus transcription/translation complex consisting of 
the late promoter and tripartite leader sequence. Insertion in a non-essential El or E3 
region of the viral genome may be used to obtain a viable virus which is capable of 
expressing TRANP in infected host cells. (See, e.g., Logan, J. and T. Shenk (1984) Proc. 
Natl. Acad. Sci. 8 1 :3655-3659.) In addition, transcription enhancers, such as the Rous 

20 sarcoma virus (RS V) enhancer, may be used to increase expression in mammalian host 
cells. 

Human artificial chromosomes (HACs) may also be employed to deliver larger 
fragments of DNA than can be contained and expressed in a piasmid. HACs of about 6 kb 
to 10 Mb are constructed and delivered via conventional delivery methods (liposomes, 

25 polycationic amino polymers, or vesicles) for therapeutic purposes. 

Specific initiation signals may also be used to achieve more efficient translation of 
sequences encoding TRANP. Such signals include the ATG initiation codon and adjacent 
sequences. In cases where sequences encoding TRANP and its initiation codon and 
upstream sequences are inserted into the appropriate expression vector, no additional 

30 transcriptional or translational control signals may be needed. However, in cases where 
only coding sequence, or a fragment thereof, is inserted, exogenous translational control 
signals including the ATG initiation codon should be provided. Furthermore, the initiation 
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codon should be in the correct reading frame to ensure translation of the entire insert. 
Exogenous translational elements and initiation codons may be of various origins, both 
natural and synthetic. The efficiency of expression may be enhanced by the inclusion of 
enhancers appropriate for the particular cell system used. (See, e.g., Scharf, D. et al. 

5 (1994) Results Probl. Cell Differ. 20:125-162.) 

In addition, a host cell strain may be chosen for its ability to modulate expression 
of the inserted sequences or to process the expressed protein in the desired fashion. Such 
modifications of the polypeptide include, but are not limited to, acetylation, carboxylation, 
glycosylation, phosphorylation, lipidation, and acylation. Post-translational processing 

10 which cleaves a "prepro" form of the protein may also be used to facilitate correct 
insertion, folding, and/or function. Different host cells which have specific cellular 
machinery and characteristic mechanisms for post-translational activities (e.g., CHO, 
HeLa, MDCK, HEK293, and WI38), are available from the American Type Culture 
Collection (ATCC, Bethesda, MD) and may be chosen to ensure the correct modification 

1 5 and processing of the foreign protein. 

For long term, high yield production of recombinant pre^ins, stable expression is 
preferred. For example, cell lines capable of stably expressing TRANP can be 
transformed using expression vectors which may contain viral origins of replication and/or 
endogenous expression elements and a selectable marker gene on the same or on a separate 

20 vector. Following the introduction of the vector, cells may be allowed to grow for about 1 
to 2 days in enriched media before being switched to selective media. The purpose of the 
selectable marker is to confer resistance to selection, and its presence allows growth and 
recovery of cells which successfully express the introduced sequences. Resistant clones of 
stably transformed cells may be proliferated using tissue culture techniques appropriate to 

25 the cell type. 

Any number of selection systems may be used to recover transformed cell lines. 
These include, but are not limited to, the herpes simplex virus thymidine kinase genes and 
adenine phosphoribosyltransferase genes, which can be employed in tk or apr cells, 
respectively. (See, e.g., Wigler, M. et al. (1977) Cell 1 1:223-232; and Lowy, I. et al. 
30 ( 1 980) Cell 22:8 1 7-823) Also, antimetabolite, antibiotic, or herbicide resistance can be 
used as the basis for selection. For example, dhfr confers resistance to methotrexate; npt 
confers resistance to the aminoglycosides neomycin and G-418; and als or pat confer 
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resistance to chlorsuifiiron and phosphinotricin acetyltransferase, respectively. (See, e.g., 
Wigler, M. et al. (1980) Proc. Natl. Acad. Sci. 77:3567-3570; Colbere-Garapin, F. et al 
(1981) J. Mol. Biol. 150:1-14; and Murry, su^ra.) Additional selectable genes have been 
described, e.g., trpB, which allows cells to utilize indole in place of tryptophan, or hisD, 
5 which allows cells to utilize histinol in place of histidine. (See, e.g., Hartman, S.C. and 
R.C. Mulligan (1988) Proc. Natl. Acad. Sci. 85:8047-8051.) Recently, the use of visible 
markers has gained popularity with such markers as anthocyanins, B glucuronidase and its 
substrate GUS, luciferase and its substrate luciferin. Green fluorescent proteins (GFP) 
(Clontech, Palo Alto, CA) are also used (See, e.g., Chalfie, M. et al. (1994) Science 
10 263:802-805.) These markers can be used not only to identify transformants, but also to 
quantify the amount of transient or stable protein expression attributable to a specific 
vector system. (See, e.g., Rhodes, CA. et al. (1995) Methods Mol. Biol. 55:121-131.) 

Although the presence/absence of marker gene expression suggests that the gene of 
interest is also present, the presence and expression of the gene may need to be confirmed. 
1 5 For example, if the sequence encoding TRANP is inserted within a marker gene sequence, 
transformed ceils containing sequences encoding TRANP can be identified by the absence 
of marker gene function. Alternatively, a marker gene can be placed in tandem with a 
sequence encoding TRANP under the control of a single promoter. Expression of the 
marker gene in response to induction or selection usually indicates expression of the 
tandem gene as well. 

Alternatively, host cells which contain the nucleic acid sequence encoding TRANP 
and express TRANP may be identified by a variety of procedures known to those of skill 
in the art. These procedures include, but are not limited to, DNA-DNA or DNA-RNA 
hybridizations and protein bioassay or immunoassay techniques which include membrane, 
solution, or chip based technologies for the detection and/or quantification of nucleic acid 
or protein sequences. 

The presence of polynucleotide sequences encoding TRANP can be detected by 
DNA-DNA or DNA-RNA hybridization or amplification using probes; or fragments or 
fragments of polynucleotides encoding TRANP. Nucleic acid amplification based assays 
involve the use of oligonucleotides or oligomers based on the sequences encoding TRANP 
to detect transformants containing DNA or RNA encoding TRANP. 

A variety of protocols for detecting and measuring the expression of TRANP, 
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using either polyclonal or monoclonal antibodies specific for the protein, are known in the 
art Examples of such techniques include enzyme-linked immunosorbent assays 
(ELISAs), radioimmunoassays (RIAs), and fluorescence activated cell sorting (FACS). A 
two-site, monoclonal-based immunoassay utilizing monoclonal antibodies reactive to two 

5 non-interfering epitopes on TRANP is preferred, but a competitive binding assay may be 
employed. These and other assays are well described in the art. (See, e.g., Hampton, R. et 
al. (1990) Serological Methods, a Laboratory Manual . APS Press, St Paul, MN, Section 
IV; and Maddox, D.E. et al. (1983) J. Exp. Med. 158:1211-1216). 

A wide variety of labels and conjugation techniques are known by those skilled in 

10 the art and may be used in various nucleic acid and amino acid assays. Means for 
producing labeled hybridization or PCR probes for detecting sequences related to 
polynucleotides encoding TRANP include oligolabeling, nick translation, end-labeling, or 
PCR amplification using a labeled nucleotide. Alternatively, the sequences encoding 
TRANP, or any fragments thereof, may be cloned into a vector for the production of an 

15 mRNA probe. Such vectors are known in the art, are commercially available, and may be 
used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase 
such as T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted using 
a variety of commercially available kits, such as those provided by Pharmacia & Upjohn 
(Kalamazoo, MI), Promega (Madison, WI), and U.S. Biochemical Corp. (Cleveland, OH). 

20 Suitable reporter molecules or labels which may be used for ease of detection include 
radionuclides, enzymes, fluorescent, chemiluminescent, or chromogenic agents, as well as 
substrates, cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding TRANP may be 
cultured under conditions suitable for the expression and recovery of the protein from cell 

25 culture. The protein produced by a transformed cell may be secreted or contained 

intracellularly depending on the sequence and/or the vector used. As will be understood 
by those of skill in the art, expression vectors containing polynucleotides which encode 
TRANP may be designed to contain signal sequences which direct secretion of TRANP 
through a prokaiyotic or eukaryotic cell membrane. Other constructions may be used to 

30 join sequences encoding TRANP to nucleotide sequences encoding a polypeptide domain 
which will facilitate purification of soluble proteins. Such purification facilitating 
domains include, but are not limited to, metal chelating peptides such as 
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histidine-tryptophan modules that allow purification on immobilized metals, protein A 
domains that allow purification on immobilized immunoglobulin, and the domain utilized 
in the FLAGS extension/affinity purification system (Immunex Corp., Seatde, WA). The 
inclusion of cleavable linker sequences, such as those specific for Factor XA or 

5 enterokinase (Invitrogen, San Diego, CA), between the purification domain and the 
TRANP encoding sequence may be used to facilitate purification. One such expression 
vector provides for expression of a fusion protein containing TRANP and a nucleic acid 
encoding 6 histidine residues preceding a thioredoxin or an enterokinase cleavage site. 
The histidine residues facilitate purification on immobilized metal ion affinity 

10 chromatography. (IMAC) (See, e.g., Porath, J. et al. (1992) Prot. Exp. Purif. 3: 263-281.) 
The enterokinase cleavage site provides a means for purifying TRANP from the fusion 
protein. (See, e.g., Kroll, D.J. et al. (1993) DNA Cell Biol. 12:441-453.) 

Fragments of TRANP may be produced not only by recombinant production, but 
also by direct peptide synthesis using solid-phase techniques. (See, e.g., Creighton, T.E. 

15 (1984) Protein: Structures and Molecular Properties, pp. 55-60, W.H. Freeman and Co., 
New York, NY.) Protein synthesis may be performed by manual techniques or by 
automation. Automated synthesis may be achieved, for example, using the Applied 
Biosystems 431 A Peptide Synthesizer (Perkin Elmer). Various fragments of TRANP may 
be synthesized separately and then combined to produce the full length molecule. 

20 

THERAPEUTICS 

Chemical and structural homology exists among the human transport-associated 
proteins of the invention. In addition, TRANP is expressed in cancer. Therefore, TRANP 
appears to play a role in cancer and transport disorders. In cancer and transport disorders 

25 where expression of TRANP is increased, or where expression of TRANP is promoting 
cancer and transport disorders, it is desirable to decrease the expression of TRANP. In 
transport disorders where expression of TRANP is decreased, it is desirable to provide the 
protein or increase the expression of TRANP. 

Therefore, in one embodiment, TRANP or a fragment or derivative thereof may be 

30 administered to a subject to treat or prevent a transport disorder associated with decreased 
expression or activity of TRANP. Such transport disorders include, but are not limited to, 
adrenoleukodystrophy, cystic fibrosis, cystinuria, cystine nephrolithiasis, Dubin- Johnson 
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syndrome, glucose-galactose malabsorption syndrome, diabetes mellitus, diabetes 
insipidus, diastrophic dysplasia, Dubin- Johnson syndrome, Fanconi disease, Fanconi- 
Bickel syndrome, Hartup disease, hyperbilirubinemia, hypercholesterolemia, 
hyperinsulinemia, hyper- and hypoglycemia, iminoglycinuria, Grave's disease, goiter, 

5 Cushing's disease, and Addison's disease, von Gierk disease, Zellweger syndrome; 
gastrointestinal disorders including ulcerative colitis, gastric and duodenal ulcers; other 
conditions associated with abnormal vesicle trafficking including AIDS; allergies 
including hay fever, asthma, and urticaria (hives); autoimmune hemolytic anemia, 
proliferative glomerulonephritis; inflammatory bowel disease, multidrug resistance, 

10 multiple sclerosis; myasthenia gravis, myocardial ischemia, rheumatoid and osteoarthritis, 
Parkinson's disease, Pendred syndrome, scleroderma, Bartter's and Gitelman's syndromes, 
Chediak-Higashi and Sjogren's syndromes, Stargardt's disease, systemic lupus 
erythematosus, and Wilson's disease. 

In another embodiment, a vector capable of expressing TRANP or a fragment or 

15 derivative thereof may be administered to a subject to treat or prevent a transport disorder 
including, but not limited to, those listed above. 

In a further embodiment, a pharmaceutical composition comprising a substantially 
purified TRANP in conjunction with a suitable pharmaceutical carrier may be 
administered to a subject to treat or prevent a transport disorder including, but not limited 

20 to, those provided above. 

In still another embodiment, an agonist which modulates the activity of TRANP 
may be administered to a subject to treat or prevent a transport disorder associated with 
increased expression or activity of TRANP. 

In one embodiment, an antagonist which modulates the activity of TRANP may be 

25 administered to a subject to treat or prevent a cancer associated with increased expression 
or activity of TRANP. Such cancers can include, but are not limited to, adenocarcinoma, 
leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, 
cancers of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall 
bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, 

30 parathyroid, penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and 
uterus. In one aspect, an antibody which specifically binds TRANP may be used directly 
as an antagonist or indirectly as a targeting or delivery mechanism for bringing a 
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pharmaceutical agent to cells or tissue which express TRANP. 

In an additional embodiment, a vector expressing the complement of the 
polynucleotide encoding TRANP may be administered to a subject to treat or prevent a 
cancer including, but not limited to, those listed above. 

5 In a further embodiment, an antagonist which modulates the activity of TRANP 

may be administered to a subject to treat or prevent a transport disorder associated with 
increased expression or activity of TRANP. Such transport disorders include, but are not 
limited to, those listed above. In one aspect, an antibody which specifically binds TRANP 
may be used directly as an antagonist or indirectly as a targeting or delivery mechanism 

10 for bringing a pharmaceutical agent to cells or tissue which express TRANP. 

In an additional embodiment, a vector expressing the complement of the 
polynucleotide encoding TRANP may be administered to a subject to treat or prevent a 
transport disorder including, but not limited to, those described above. 

In other embodiments, any of the proteins, antagonists, antibodies, agonists, 

!5 complementary sequences, or vectors of the invention may be administered in combination 
with other appropriate therapeutic agents. Selection of the appropriate agents for use in 
combination therapy may be made by one of ordinary skill in the art, according to 
conventional pharmaceutical principles. The combination of therapeutic agents may act 
synergistically to effect the treatment or prevention of the various disorders described 

20 above. Using this approach, one may be able to achieve therapeutic efficacy with lower 
dosages of each agent, thus reducing the potential for adverse side effects. 

An antagonist of TRANP may be produced using methods which are generally 
known in the art. In particular, purified TRANP may be used to produce antibodies or to 
screen libraries of pharmaceutical agents to identify those which specifically bind TRANP. 

25 Antibodies to TRANP may also be generated using methods that are well known in the art. 
Such antibodies may include, but are not limited to, polyclonal, monoclonal, chimeric, and 
single chain antibodies, Fab fragments, and fragments produced by a Fab expression 
library. Neutralizing antibodies (i.e., those which inhibit dimer formation) are especially 
preferred for therapeutic use. 

30 For the production of antibodies, various hosts including goats, rabbits, rats, mice, 

humans, and others may be immunized by injection with TRANP or with any fragment or 
oligopeptide thereof which has immunogenic properties. Depending on the host species, 
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various adjuvants may be used to increase immunological response. Such adjuvants 
include, but are not limited to, Freund's, mineral gels such as aluminum hydroxide, and 
surface active substances such as lysolecithin, pluronic polyols, polyanions, peptides, oil 
emulsions, KLH, and dinitrophenol. Among adjuvants used in humans, BCG (bacilli 

5 Calmette-Guerin) and r.n rvnebacteri um parvum are especially preferable. 

It is preferred that the oligopeptides, peptides, or fragments used to induce 
antibodies to TRANP have an amino acid sequence consisting of at least about 5 amino 
acids, and, more preferably, of at least about 10 amino acids. It is also preferable that 
these oligopeptides, peptides, or fragments are identical to a portion of the amino acid 

10 sequence of the natural protein and contain the entire amino acid sequence of a small, 
naturally occurring molecule. Short stretches of TRANP amino acids may be fused with 
those of another protein, such as KLH, and antibodies to the chimeric molecule may be 
produced. 

Monoclonal antibodies to TRANP may be prepared using any technique which 

1 5 provides for the production of antibody molecules by continuous cell lines in culture. 
These include, but are not limited to, the hybridoma technique, the human B-cell 
hybridoma technique, and the EBV-hybridoma technique. (See, e.g., Kohler, G. et al. 
(1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. Immunol. Methods 81:31-42; Cote, 
RJ. et al. (1983) Proc. Natl. Acad. Sci. 80:2026-2030; and Cole, S.P. et al. (1984) Mol. 

20 Cell Biol. 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies " such 
as the splicing of mouse antibody genes to human antibody genes to obtain a molecule 
with appropriate antigen specificity and biological activity, can be used. (See, e.g., 
Morrison, S.L. et al. (1984) Proc. Natl. Acad. Sci. 81:6851-6855; Neuberger, M.S. et al. 

25 (1984) Nature 312:604-608; and Takeda, S. et al. (1985) Nature 314:452-454.) 

Alternatively, techniques described for the production of single chain antibodies may be 
adapted, using methods known in the art, to produce TRANP-specific single chain 
antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may 
be generated by chain shuffling from random combinatorial immunoglobulin libraries. 

30 (See, e.g., Burton D.R. (1991) Proc. Natl. Acad. Sci. 88:10134-10137.) 

Antibodies may also be produced by inducing in vivo production in the 
lymphocyte population or by screening immunoglobulin libraries or panels of highly 
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specific binding reagents as disclosed in the literature. (See, e.g., Orlandi, R. et al. (1989) 
Proc. Natl. Acad. Sci. 86: 3833-3837; and Winter, G. et al. (1991) Nature 349:293-299.) 

Antibody fragments which contain specific binding sites for TRANP may also be 
generated. For example, such fragments include, but are not limited to, F(ab')2 fragments 
5 produced by pepsin digestion of the antibody molecule and Fab fragments generated by 
reducing the disulfide bridges of the F(ab r )2 fragments. Alternatively, Fab expression 
libraries may be constructed to allow rapid and easy identification of monoclonal Fab 
fragments with the desired specificity. (See, e.g., Huse, W.D. et al. (1989) Science 
246:1275-1281.) 

1 0 Various immunoassays may be used for screening to identify antibodies having the 

desired specificity. Numerous protocols for competitive binding or immunoradiometric 
assays using either polyclonal or monoclonal antibodies with established specificities are 
well known in the art. Such immunoassays typically involve the measurement of complex 
formation between TRANP and its specific antibody. A two-site, monoclonal-based 

15 immunoassay utilizing monoclonal antibodies reactive to two non-interfering TRANP 
epitopes is preferred, but a competitive binding assay may also be employed. (Maddox, 
supra . ) 

In another embodiment of the invention, the polynucleotides encoding TRANP. or 
any fragment or complement thereof, may be used for therapeutic purposes. In one aspect, 

20 the complement of the polynucleotide encoding TRANP may be used in situations in 
which it would be desirable to block the transcription of the mRNA. In particular, cells 
may be transformed with sequences complementary to polynucleotides encoding TRANP. 
Thus, complementary molecules or fragments may be used to modulate TRANP activity, 
or to achieve regulation of gene function. Such technology is now well known in the art, 

25 and sense or antisense oligonucleotides or larger fragments can be designed from various 
locations along the coding or control regions of sequences encoding TRANP. 

Expression vectors derived from retroviruses, adenoviruses, or herpes or vaccinia 
viruses, or from various bacterial plasmids, may be used for delivery of nucleotide 
sequences to the targeted organ, tissue, or cell population. Methods which are well known 

30 to those skilled in the art can be used to construct vectors which will express nucleic acid 
sequences complementary to the polynucleotides of the gene encoding TRANP. (See, e.g., 
Sambrook, supra : and Ausubel, supra .;) 
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Genes encoding TRANP can be turned off by transforming a cell or tissue with 
expression vectors which express high levels of a polynucleotide, or fragment thereof, 
encoding TRANP. Such constructs may be used to introduce untranslatable sense or 
antisense sequences into a cell. Even in the absence of integration into the DNA, such 
5 vectors may continue to transcribe RNA molecules until they are disabled by endogenous 
nucleases. Transient expression may last for a month or more with a non-replicating 
vector, and may last even longer if appropriate replication elements are part of the vector 
system. 

As mentioned above, modifications of gene expression can be obtained by 

10 designing complementary sequences or antisense molecules (DNA, RNA, or PNA) to the 
control, 5\ or regulatory regions of the gene encoding TRANP. Oligonucleotides derived 
from the transcription initiation site, e.g., between about positions -10 and +10 from the 
start site, are preferred. Similarly, inhibition can be achieved using triple helix 
base-pairing methodology. Triple helix pairing is useful because it causes inhibition of the 

15 ability of the double helix to open sufficiently for the binding of polymerases, transcription 
factors, or regulatory molecuhs. Recent therapeutic advances using triplex DNA have 
been described in the literature. (See, e.g., Gee, J.E. et al. (1994) in Huber, B.E. and B.l. 
Carr, Molecular and Immunologic Approaches . Futura Publishing Co., Mt. Kisco, NY, pp. 
163-177.) A complementary sequence or antisense molecule may also be designed to 

20 block translation of mRNA by preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific 
cleavage of RNA. The mechanism of ribozyme action involves sequence-specific 
hybridization of the ribozyme molecule to complementary target RNA, followed by 
endonucleolytic cleavage. For example, engineered hammerhead motif ribozyme 

25 molecules may specifically and efficiently catalyze endonucleolytic cleavage of sequences 
encoding TRANP. 

Specific ribozyme cleavage sites within any potential RNA target are initially 
identified by scanning the target molecule for ribozyme cleavage sites, including the 
following sequences: GUA, GUU, and GUC. Once identified, short RNA sequences of 
30 between 1 5 and 20 ribonucleotides, corresponding to the region of the target gene 

containing the cleavage site, may be evaluated for secondary structural features which may 
render the oligonucleotide inoperable. The suitability of candidate targets may also be 
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evaluated by testing accessibility to hybridization with complementary oligonucleotides 
using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be 
prepared by any method known in the art for the synthesis of nucleic acid molecules. 

5 These include techniques for chemically synthesizing oligonucleotides such as solid phase 
phosphoramidite chemical synthesis. Alternatively, RNA molecules may be generated by 
in vitro and in vivo transcription of DNA sequences encoding TRANP. Such DN A 
sequences may be incorporated into a wide variety of vectors with suitable RNA 
polymerase promoters such as T7 or SP6. Alternatively, these cDNA constructs that 

10 synthesize complementary RNA, constitutively or inducibly, can be introduced into cell 
lines, cells, or tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. 
Possible modifications include, but are not limited to, the addition of flanking sequences at 
the 5' and/or 3' ends of the molecule, or the use of phosphorothioate or 2' O-methyl rather 

15 than phosphodiesterase linkages within the backbone of the molecule. This concept is 
inherent in the production of PNAs and can be extended in all of these molecules by the 
inclusion of nontraditional bases such as inosine, queosine, and wybutosine, as well as 
acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, guanine, 
thymine, and uridine which are not as easily recognized by endogenous endonucleases. 

20 Many methods for introducing vectors into cells or tissues are available and 

equally suitable for use in vivo , in vitro , and ex vivo . For ex vivo therapy, vectors may be 
introduced into stem cells taken from the patient and clonally propagated for autologous 
transplant back into that same patient. Delivery by transfection, by liposome injections, or 
by polycationic amino polymers may be achieved using methods which are well known in 

25 the art. (See, e.g., Goldman, C.K. et al. (1997) Nature Biotechnology 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in 
need of such therapy, including, for example, mammals such as dogs, cats, cows, horses, 
rabbits, monkeys, and most preferably, humans. 

An additional embodiment of the invention relates to the administration of a 

30 pharmaceutical or sterile composition, in conjunction with a pharmaceutically acceptable 
carrier, for any of the therapeutic effects discussed above. Such pharmaceutical 
compositions may consist of TRANP, antibodies to TRANP, and mimetics, agonists, 
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antagonists, or inhibitors of TRANP. The compositions may be administered alone or in 
combination with at least one other agent, such as a stabilizing compound, which may be 
administered in any sterile, biocompatible pharmaceutical carrier including, but not limited 
to, saline, buffered saline, dextrose, and water. The compositions may be administered to a 
patient alone, or in combination with other agents, drugs, or hormones. 

The pharmaceutical compositions utilized in this invention may be administered by 
any number of routes including, but not limited to, oral, intravenous, intramuscular, 
intra-arterial, intramedullary, intrathecal, intraventricular, transdermal, subcutaneous, 
intraperitoneal, intranasal, enteral, topical, sublingual, or rectal means. 

In addition to the active ingredients, these pharmaceutical compositions may 
contain suitable pharmaceutically-acceptable carriers comprising excipients and auxiliaries 
which facilitate processing of the active compounds into preparations which can be used 
pharmaceutically. Further details on techniques for formulation and administration may 
be found in the latest edition of Remington's Pharmace utical Sciences (Maack Publishing 
Co., Easton, PA). 

Pharmaceutical compositions for oral administration can be formulated using 
pharmaceutically acceptable carriers well known in the art in dosages suitable for oral 
administration. Such carriers enable the pharmaceutical compositions to be formulated as 
tablets, pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions, and the like, for 
ingestion by the patient. 

Pharmaceutical preparations for oral use can be obtained through combining active 
compounds with solid excipient and processing the resultant mixture of granules 
(optionally, after grinding) to obtain tablets or dragee cores. Suitable auxiliaries can be 
added, if desired. Suitable excipients include carbohydrate or protein fillers, such as 
sugars, including lactose, sucrose, mannitol, and sorbitol; starch from corn, wheat, rice, 
potato, or other plants; cellulose, such as methyl cellulose, 

hydroxypropylmethyl-cellulose, or sodium carboxymethylcellulose; gums, including 
arabic and tragacanth; and proteins, such as gelatin and collagen. If desired, disintegrating 
or solubilizing agents may be added, such as the cross-linked polyvinyl pyrrolidone, agar, 
and alginic acid or a salt thereof, such as sodium alginate. 

Dragee cores may be used in conjunction with suitable coatings, such as 
concentrated sugar solutions, which may also contain gum arabic, talc, 
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polyvinylpyrrolidone, carbopol gel, polyethylene glycol, and/or titanium dioxide, lacquer 
solutions, and suitable organic solvents or solvent mixtures. Dyestuffs or pigments may 
be added to the tablets or dragee coatings for product identification or to characterize the 
quantity of active compound, i.e., dosage. 

5 Pharmaceutical preparations which can be used orally include push-fit capsules 

made of gelatin, as well as soft, sealed capsules made of gelatin and a coating, such as 
glycerol or sorbitol. Push-fit capsules can contain active ingredients mixed with fillers or 
binders, such as lactose or starches, lubricants, such as talc or magnesium stearate, and, 
optionally, stabilizers. In soft capsules, the active compounds may be dissolved or 

10 suspended in suitable liquids, such as fatty oils, liquid, or liquid polyethylene glycol with 
or without stabilizers. 

Pharmaceutical formulations suitable for parenteral administration may be 
formulated in aqueous solutions, preferably in physiologically compatible buffers such as 
Hanks's solution, Ringer's solution, or physiologically buffered saline. Aqueous injection 

15 suspensions may contain substances which increase the viscosity of the suspension, such 
as sodium carboxy methyl cellulose, sorbitol, or dextran. Additionally, suspensions of the 
active compounds may be prepared as appropriate oily injection suspensions. Suitable 
lipophilic solvents or vehicles include fatty oils, such as sesame oil, or synthetic fatty acid 
esters, such as ethyl oleate, triglycerides, or liposomes. Non-lipid polycationic amino 

20 polymers may also be used for delivery. Optionally, the suspension may also contain 
suitable stabilizers or agents to increase the solubility of the compounds and allow for the 
preparation of highly concentrated solutions. 

For topical or nasal administration, penetrants appropriate to the particular barrier 
to be permeated are used in the formulation. Such penetrants are generally known in the 

25 art. 

The pharmaceutical compositions of the present invention may be manufactured in 
a manner that is known in the art, e.g., by means of conventional mixing, dissolving, 
granulating, dragee-making, levigating, emulsifying, encapsulating, entrapping, or 
lyophilizing processes. 

30 The pharmaceutical composition may be provided as a salt and can be formed with 

many acids, including but not limited to, hydrochloric, sulfuric, acetic, lactic, tartaric, 
malic, and succinic acid. Salts tend to be more soluble in aqueous or other protonic 
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solvents than are the corresponding free base forms. In other cases, the preferred 
preparation may be a lyophilized powder which may contain any or all of the following: 1 
mM to 50 mM histidine, 0.1% to 2% sucrose, and 2% to 7% mannitol, at a pH range of 4.5 
to 5.5, that is combined with buffer prior to use. 
5 After pharmaceutical compositions have been prepared, they can be placed in an 

appropriate container and labeled for treatment of an indicated condition. For 
administration of TRANP, such labeling would include amount, frequency, and method of 
administration. 

Pharmaceutical compositions suitable for use in the invention include compositions 
10 wherein the active ingredients are contained in an effective amount to achieve the intended 
purpose. The determination of an effective dose is well within the capability of those 
skilled in the art. 

For any compound, the therapeutically effective dose can be estimated initially 
either in cell culture assays, e.g., of neoplastic cells or in animal models such as mice, rats, 

15 rabbits, dogs, or pigs. An animal model may also be used to determine the appropriate 
concentration range and route of administration. Such information can then be used to 
determine useful doses and routes for administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for 
example TRANP or fragments thereof, antibodies of TRANP, and agonists, antagonists or 

20 inhibitors of TRANP, which ameliorates the symptoms or condition. Therapeutic efficacy 
and toxicity may be determined by standard pharmaceutical procedures in cell cultures or 
with experimental animals, such as by calculating the ED50 (the dose therapeutically 
effective in 50% of the population) or LD50 (the dose lethal to 50% of the population) 
statistics. The dose ratio of therapeutic to toxic effects is the therapeutic index, and it can 

25 be expressed as the ED50/LD50 ratio. Pharmaceutical compositions which exhibit large 
therapeutic indices are preferred. The data obtained from cell culture assays and animal 
studies are used to formulate a range of dosage for human use. The dosage contained in 
such compositions is preferably within a range of circulating concentrations that includes 
the ED50 with little or no toxicity. The dosage varies within this range depending upon 

30 the dosage form employed, the sensitivity of the patient, and the route of administration. 

The exact dosage will be determined by the practitioner, in light of factors related 
to the subject requiring treatment. Dosage and administration are adjusted to provide 
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sufficient levels of the active moiety or to maintain the desired effect. Factors which may 
be taken into account include the severity of the disease state, the general health of the 
subject, the age, weight, and gender of the subject, time and frequency of administration, 
drug combination(s), reaction sensitivities, and response to therapy. Long-acting 
5 pharmaceutical compositions may be administered every 3 to 4 days, every week, or 
biweekly depending on the half-life and clearance rate of the particular formulation. 

Normal dosage amounts may vary from about 0.1 jug to 100,000 //g, up to a total 
dose of about 1 gram, depending upon the route of administration. Guidance as to 
particular dosages and methods of delivery is provided in the literature and generally 
10 available to practitioners in the art. Those skilled in the art will employ different 

formulations for nucleotides than for proteins or their inhibitors. Similarly, delivery of 
polynucleotides or polypeptides will be specific to particular cells, conditions, locations, 
etc. 

DIAGNOSTICS 

In another embodiment, antibodies which specifically bind TRANP may be used 
for the diagnosis of disorders characterized by expression of TRANP, or in assays to 
monitor patients being treated with TRANP or agonists, antagonists, or inhibitors of 
TRANP. Antibodies useful for diagnostic purposes may be prepared in the same manner 
as described above for therapeutics. Diagnostic assays for TRANP include methods which 
utilize the antibody and a label to detect TRANP in human body fluids or in extracts of 
cells or tissues. The antibodies may be used with or without modification, and may be 
labeled by covalent or non-covalent attachment of a reporter molecule. A wide variety of 
reporter molecules, several of which are described above, are known in the art and may be 
used. 

A variety of protocols for measuring TRANP, including ELISAs, RIAs, and 
FACS, are known in the art and provide a basis for diagnosing altered or abnormal levels 
of TRANP expression. Normal or standard values for TRANP expression are established 
by combining body fluids or cell extracts taken from normal mammalian subjects, 
preferably human, with antibody to TRANP under conditions suitable for complex 
formation The amount of standard complex formation may be quantitated by various 
methods, preferably by photometric means. Quantities of TRANP expressed in subject, 
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control, and disease samples from biopsied tissues are compared with the standard values. 
Deviation between standard and subject values establishes the parameters for diagnosing 
disease. 

In another embodiment of the invention, the polynucleotides encoding TRANP 
5 may be used for diagnostic purposes. The polynucleotides which may be used include 
oligonucleotide sequences, complementary RNA and DNA molecules, and PNAs. The 
polynucleotides may be used to detect and quantitate gene expression in biopsied tissues in 
which expression of TRANP may be correlated with disease. The diagnostic assay may be 
used to determine absence, presence, and excess expression of TRANP, and to monitor 

10 regulation of TRANP levels during therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of detecting 
polynucleotide sequences, including genomic sequences, encoding TRANP or closely 
related molecules may be used to identify nucleic acid sequences which encode TRANP. 
The specificity of the probe, whether it is made from a highly specific region, e.g., the 5 f 

15 regulatory region, or from a less specific region, e.g., a conserved motif, and the 

stringency of the hybridization or amplification (maximal, high, intermediate, or low), will 
determine whether the probe identifies only naturally occurring sequences encoding 
TRANP, alleles, or related sequences. 

Probes may also be used for the detection of related sequences, and should 

20 preferably contain at least 50% of the nucleotides from any of the TRANP encoding 
sequences. The hybridization probes of the subject invention may be DNA or RNA and 
may be derived from the sequence of SEQ ID NO: 10, SEQ ID NO:l 1, SEQ ID NO: 12, 
SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, or SEQ 
ID NO: 10, or from genomic sequences including promoters, enhancers, and introns of the 

25 TRANP gene. 

Means for producing specific hybridization probes for DNAs encoding TRANP 
include the cloning of polynucleotide sequences encoding TRANP or TRANP derivatives 
into vectors for the production of mRNA probes. Such vectors are known in the art, are 
commercially available, and may be used to synthesize RNA probes in vitro by means of 
30 the addition of the appropriate RNA polymerases and the appropriate labeled nucleotides. 
Hybridization probes may be labeled by a variety of reporter groups, for example, by 
radionuclides such as 32 P or 35 S, or by enzymatic labels, such as alkaline phosphatase 
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coupled to the probe via avidin/biotin coupling systems, and the like. 

Polynucleotide sequences encoding TRANP may be used for the diagnosis of a 
disorder associated with expression of TRANP. Examples of such a disorder include, but 
are not limited to, transport disorders such as adrenoleukodystrophy, cystic fibrosis, 

5 cystinuria, cystine nephrolithiasis, Dubin- Johnson syndrome, glucose-galactose 
malabsorption syndrome, diabetes mellitus, diabetes insipidus, diastrophic dysplasia, 
Dubin- Johnson syndrome, Fanconi disease, Fanconi-Bickel syndrome, Hartup disease, 
hyperbilirubinemia, hypercholesterolemia, hyperinsulinemia, hyper- and hypoglycemia, 
iminoglycinuria, Grave's disease, goiter, Cushing's disease, and Addison's disease, von 

10 Gierk disease, Zellweger syndrome; gastrointestinal disorders including ulcerative colitis, 
gastric and duodenal ulcers; other conditions associated with abnormal vesicle trafficking 
including AIDS; allergies including hay fever, asthma, and urticaria (hives); autoimmune 
hemolytic anemia, proliferative glomerulonephritis; inflammatory bowel disease, 
multidrug resistance, multiple sclerosis; myasthenia gravis, myocardial ischemia, 

15 rheumatoid and osteoarthritis, Parkinson's disease, Pendred syndrome, scleroderma, 
Bartter's and Gitelmaris syndromes, Chediak-Higashi and Sjogren's syndromes, 
Stargardt's disease, systemic lupus erythematosus, and Wilson's disease; and cancer such 
as adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, 
teratocarcinoma, and, in particular, cancers of the adrenal gland, bladder, bone, bone 

20 marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, 
liver, lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, 
spleen, testis, thymus, thyroid, and uterus. The polynucleotide sequences encoding 
TRANP may be used in Southern or northern analysis, dot blot, or other membrane-based 
technologies; in PCR technologies; in dipstick, pin, and ELISA assays; and in microarrays 

25 utilizing fluids or tissues from patients to detect altered TRANP expression. Such 
qualitative or quantitative methods are well known in the art. 

In a particular aspect, the nucleotide sequences encoding TRANP may be useful in 
assays that detect the presence of associated disorders, particularly those mentioned above. 
The nucleotide sequences encoding TRANP may be labeled by standard methods and 

30 added to a fluid or tissue sample from a patient under conditions suitable for the formation 
of hybridization complexes. After a suitable incubation period, the sample is washed and 
the signal is quantitated and compared with a standard value. If the amount of signal in 
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the patient sample is significantly altered in comparison to a control sample then the 
presence of altered levels of nucleotide sequences encoding TRANP in the sample 
indicates the presence of the associated disorder. Such assays may also be used to 
evaluate the efficacy of a particular therapeutic treatment regimen in animal studies, in 
5 clinical trials, or to monitor the treatment of an individual patient. 

In order to provide a basis for the diagnosis of a disorder associated with 
expression of TRANP, a normal or standard profile for expression is established. This 
may be accomplished by combining body fluids or cell extracts taken from normal 
subjects, either animal or human, with a sequence, or a fragment thereof, encoding 

10 TRANP, under conditions suitable for hybridization or amplification. Standard 

hybridization may be quantified by comparing the values obtained from normal subjects 
with values from an experiment in which a known amount of a substantially purified 
polynucleotide is used. Standard values obtained in this manner may be compared with 
values obtained from samples from patients who are symptomatic for a disorder. 

1 5 Deviation from standard values is used to establish the presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to determine if the level of 
expression in the patient begins to approximate that which is observed in the normal 
subject. The results obtained from successive assays may be used to show the efficacy of 

20 treatment over a period ranging from several days to months. 

With respect to cancer, the presence of a relatively high amount of transcript in 
biopsied tissue from an individual may indicate a predisposition for the development of 
the disease, or may provide a means for detecting the disease prior to the appearance of 
actual clinical symptoms. A more definitive diagnosis of this type may allow health 

25 professionals to employ preventative measures or aggressive treatment earlier thereby 
preventing the development or further progression of the cancer. ■ 

Additional diagnostic uses for oligonucleotides designed from the sequences 
encoding TRANP may involve the use of PCR. These oligomers may be chemically 
synthesized, generated enzymatically, or produced in vitro . Oligomers will preferably 

30 contain a fragment of a polynucleotide encoding TRANP, or a fragment of a 

polynucleotide complementary to the polynucleotide encoding TRANP, and will be 
employed under optimized conditions for identification of a specific gene or condition. 



WO 99/41373 



PCT/US99/02527 



Oligomers may also be employed under less stringent conditions for detection or 
quantitation of closely related DNA or RNA sequences. 

Methods which may also be used to quantitate the expression of TRANP include 
radiolabeling or biotinylating nucleotides, coamplification of a control nucleic acid, and 

5 interpolating results from standard curves. (See, e.g., Melby, P.C. et al. (1993) J. 

Immunol. Methods 159:235-244; and Duplaa, C. et al. (1993) Anal. Biochem. 229-236.) 
The speed of quantitation of multiple samples may be accelerated by running the assay in 
an ELISA format where the oligomer of interest is presented in various dilutions and a 
spectrophotometric or colorimetric response gives rapid quantitation. 

10 In further embodiments, oligonucleotides or longer fragments derived from any of 

the polynucleotide sequences described herein may be used as targets in a microarray. The 
microarray can be used to monitor the expression level of large numbers of genes 
simultaneously and to identify genetic variants, mutations, and polymorphisms. This 
information may be used to determine gene function, to understand the genetic basis of a 

15 disorder, to diagnose a disorder, and to develop and monitor the activities of therapeutic 
agents. 

Microarrays may be prepared, used, and analyzed using methods known in the art. 
(See, e.g., Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) 
Proc. Natl. Acad. Sci. 93:10614-10619; Baldeschweiler et al. (1995) PCT application 
20 W095/251 116; Shalon, D. et al. (1995) PCT application WO95/35505; Heller, R.A. et al. 
(1997) Proc. Natl. Acad. Sci. 94:2150-2155; and Heller, M.J. et al. (1997) U.S. Patent No. 
5,605,662.) 

In another embodiment of the invention, nucleic acid sequences encoding TRANP 
may be used to generate hybridization probes useful in mapping the naturally occurring 

25 genomic sequence. The sequences may be mapped to a particular chromosome, to a 
specific region of a chromosome, or to artificial chromosome constructions, e.g., human 
artificial chromosomes (HACs), yeast artificial chromosomes (YACs), bacterial artificial 
chromosomes (BACs), bacterial PI constructions, or single chromosome cDNA libraries. 
(See, e.g., Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. (1991) Trends Genet 

30 7:149-154.) 

Fluorescent in situ hybridization (FISH) may be correlated with other physical 
chromosome mapping techniques and genetic map data. (See, e.g., Heinz-Ulrich, et al. 
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(1995) in Meyers, R.A. (ed.) Molecular Biolpgy aqd Biotechnology, VCH Publishers New 
York, NY, pp. 965-968.) Examples of genetic map data can be found in various scientific 
journals or at the Online Mendelian Inheritance in Man (OMIM) site. Correlation between 
the location of the gene encoding TRANP on a physical chromosomal map and a specific 
5 disorder, or a predisposition to a specific disorder, may help define the region of DNA 
associated with that disorder. The nucleotide sequences of the invention may be used to 
detect differences in gene sequences among normal, carrier, and affected individuals. 

In situ hybridization of chromosomal preparations and physical mapping 
techniques, such as linkage analysis using established chromosomal markers, may be used 
for extending genetic maps. Often the placement of a gene on the chromosome of another 
mammalian species, such as mouse, may reveal associated markers even if the number or 
arm of a particular human chromosome is not known. New sequences can be assigned to 
chromosomal arms by physical mapping. This provides valuable information to 
investigators searching for disease genes using positional cloning or other gene discovery 
techniques. Once the disease or syndrome has been crudely localized by genetic linkage to 
a particular genomic region, e.g., AT to 1 lq22-23, ai;y sequences mapping to that area 
may represent associated or regulatory genes for further investigation. (See, e.g., Gatti, 
R.A. et al. (1988) Nature 336:577-580.) The nucleotide sequence of the subject invention 
may also be used to detect differences in the chromosomal location due to translocation, 
inversion, etc., among normal, carrier, or affected individuals. 

In another embodiment of the invention, TRANP, its catalytic or immunogenic 
fragments, or oligopeptides thereof can be used for screening libraries of compounds in 
any of a variety of drug screening techniques. The fragment employed in such screening 
may be free in solution, affixed to a solid support, borne on a cell surface, or located 
intracellularly. The formation of binding complexes between TRANP and the agent being 
tested may be measured. 

Another technique for drug screening provides for high throughput screening of 
compounds having suitable binding affinity to the protein of interest (See, e.g., Geysen, 
et al. (1984) PCT application WO84/03564.) In this method, large numbers of different 
small test compounds are synthesized on a solid substrate, such as plastic pins or some 
other surface. The test compounds are reacted with TRANP, or fragments thereof, and 
washed. Bound TRANP is then detected by methods well known in the art. Purified 
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TRANP can also be coated directly onto plates for use in the aforementioned drug 
screening techniques. Alternatively, non-neutralizing antibodies can be used to capture the 
peptide and immobilize it on a solid support. 

In another embodiment, one may use competitive drug screening assays in which 
5 neutralizing antibodies capable of binding TRANP specifically compete with a test 
compound for binding TRANP. In this manner, antibodies can be used to detect the 
presence of any peptide which shares one or more antigenic determinants with TRANP. 

In additional embodiments, the nucleotide sequences which encode TRANP may 
be used in any molecular biology techniques that have yet to be developed, provided the 
10 new techniques rely on properties of nucleotide sequences that are currently known, 
including, but not limited to, such properties as the triplet genetic code and specific base 
pair interactions. 

The examples below are provided to illustrate the subject invention and are not 
included for the purpose of limiting the invention. 

15 

EXAMPLES 

For purposes of example, the preparation and sequencing of the PROSTUT10 
cDNA library, from which Incyte Clone 1689731 was isolated, is described below. 
Preparation and sequencing of cDNAs in libraries in the LIFESEQ™ database have varied 
20 over time, and the gradual changes involved use of kits, plasmids, and machinery available 
at the time the library was made and analyzed. 

I. PROSTUT10 cDNA Library Construction 

The PROSTUT10 cDNA library was constructed from prostate tumor tissue 

25 obtained from a 66-year-old Caucasian male. The tissue was excised when the patient 
underwent a radical prostatectomy and regional lymph node excision. The pathology 
report indicated a prostate tumor Gleason grade (2+3) involving the left and right side 
centrally. The tumor was confined and did not involve the capsule. Perineural invasion 
was absent. Initially, the patient presented with elevated prostate specific antigen (PSA). 

30 The patient history included benign hypertension and alcohol use. The patient's family 
history included a malignant neoplasm of the prostate and a malignant neoplasm of bone 
and articular cartilage in the patient's father, and benign hypertension the patient's sibling. 
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The frozen tissue was homogenized and lysed using a Brinkmann Homogenizer 
Polytron PT-3000 (Brinkmann Instruments, Westbury, NJ) in guanidinium isothiocyanate 
solution. The lysate was centrifuged over a 5.7 M CsCl cushion using an Beckman SW28 
rotor in a Beckman L8-70M Ultracentrifuge (Beckman Instruments) for 18 hours at 25,000 

5 rpm at ambient temperature. The RNA was extracted with acid phenol pH 4.7, 
precipitated using 0.3 M sodium acetate and 2.5 volumes of ethanol, resuspended in 
RNAse-free water, and DNase treated at 37°C. RNA extraction and precipitation were 
repeated as before. The mRNA was isolated with the Qiagen Oligotex kit (QIAGEN, Inc., 
Chatsworth, CA) and used to construct the cDNA libraries. 

10 The mRNA was handled according to the recommended protocols in the 

Superscript Plasmid System (Cat. #18248-013, Gibco/BRL, Gaithersburg, MD). 
PROSTUT10 cDNAs were fractionated on a Sepharose CL4B column (Cat. #275105-01, 
Pharmacia), and those cDNAs exceeding 400 bp were ligated into pINCY. The plasmid 
pINCY was subsequently transformed into DH5a™ competent cells (Cat. #18258-012, 

15 Gibco/BRL). 

II Isolation and Sequencing of cDNA Clones 

Plasmid DNA was released from the cells and purified using the REAL Prep 96 
Plasmid Kit (Catalog #26173, QIAGEN, Inc.). The recommended protocol was employed 

20 except for the following changes: 1) the bacteria were cultured in 1 ml of sterile Terrific 
Broth (Catalog #2271 1, Gibco/BRL) with carbenicillin at 25 mg/L and glycerol at 0.4%; 
2) after inoculation, the cultures were incubated for 19 hours and at the end of incubation, 
the cells were lysed with 0.3 ml of lysis buffer; and 3) following isopropanol precipitation, 
the plasmid DNA pellet was resuspended in 0.1 ml of distilled water. After the last step in 

25 the protocol, samples were transferred to a 96-well block for storage at 4° C. 

The cDNAs were sequenced by the method of Sanger et al. (1975, J. Mol. Biol. 
94:44 If), using a Hamilton Micro Lab 2200 (Hamilton, Reno, NV) in combination with 
Peltier Thermal Cyclers (PTC200 from MJ Research, Watertown, MA) and Applied 
Biosystems 377 DNA Sequencing Systems; and the reading frame was determined. 

30 

III. Homology Searching of cDN A Clones and Their Deduced Proteins 

The nucleotide sequences and/or amino acid sequences of the Sequence Listing 
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were used to query sequences in the GenBank, SwissProt, BLOCKS, and Pima II 
databases. These databases, which contain previously identified and annotated sequences, 
were searched for regions of homology using BLAST (Basic Local Alignment Search 
Tool). (See, e.g., Altschul, S.F. (1993) J. Mol. Evol 36:290-300; and Altschul et al. (1990) 

5 J. Mol. Biol. 215:403-410.) 

BLAST produced alignments of both nucleotide and amino acid sequences to 
determine sequence similarity. Because of the local nature of the alignments, BLAST was 
especially useful in determining exact matches or in identifying homologs which may be 
of prokaryotic (bacterial) or eukaryotic (animal, fungal, or plant) origin. Other algorithms 

1 0 could have been used when dealing with primary sequence patterns and secondary 
structure gap penalties. (See, e.g., Smith, T. et al. (1992) Protein Engineering 5:35-51.) 
The sequences disclosed in this application have lengths of at least 49 nucleotides and 
have no more than 12% uncalled bases (where N is recorded rather than A, C, G, or T). 
The BLAST approach searched for matches between a query sequence and a 

15 database sequence. BLAST evaluated the statistical significance of any matches found, 
and reported only those matches that satisfy the user-selected threshold o : significance. In 
this application, threshold was set at 10" 25 for nucleotides and 10* 8 for peptides. 

Incyte nucleotide sequences were searched against the GenBank databases for 
primate (pri), rodent (rod), and other mammalian sequences (mam), and deduced amino 

20 acid sequences from the same clones were then searched against GenBank functional 
protein databases, mammalian (mamp), vertebrate (vrtp), and eukaryote (eukp), for 
homology. 

Additionally, sequences identified from cDNA libraries may be analyzed to 
identify those gene sequences encoding conserved protein motifs using an appropriate 

25 analysis program, e.g., the Block 2 Bioanalysis Program (Incyte, Palo Alto, CA). This 
motif analysis program, based on sequence information contained in the Swiss-Prot 
Database and PROSITE, is a method of determining the function of uncharacterized 
proteins translated from genomic or cDNA sequences. (See, e.g., Bairoch, A. et al. (1997) 
Nucleic Acids Res. 25:217-221; and Attwood, T. K. et al. (1997) J. Chem. Inf. Comput 

30 Sci. 37:41 7-424.) PROSITE may be used to identify common functional or structural 
domains in divergent proteins. The method is based on weight matrices. Motifs identified 
by this method are then calibrated against the SWISS-PROT database in order to obtain a 
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measure of the chance distribution of the matches. 

In another alternative, Hidden Markov models (HMMs) may be used to find 
protein domains, each defined by a dataset of proteins known to have a common biological 
function. (See, e.g., Pearson, W.R. and D.J. Lipman (1988) Proc. Natl. Acad. Sci. 

5 85:2444-2448; and Smith, T.F. and M.S. Waterman (1981) J. Mol. Biol. 147:195-197.) 
HMMs were initially developed to examine speech recognition patterns, but are now being 
used in a biological context to analyze protein and nucleic acid sequences as well as to 
model protein structure. (See, e.g., Krogh, A. et al. (1994) J. Mol. Biol. 235:1501-1531; 
and Collin, M. et al. (1993) Protein Sci. 2:305-314.) HMMs have a formal probabilistic 

10 basis and use position-specific scores for amino acids or nucleotides. The algorithm 
continues to incorporate information from newly identified sequences to increase its motif 
analysis capabilities. 

IV. Northern Analysis 

15 Northern analysis is a laboratory technique used to detect the presence of a 

transcript of a gene and involves the hybridization of a labeled nucleotide sequence to a 
membrane on which RNAs from a particular cell type or tissue have been bound. (See, 
e.g., Sambrook, supra , ch. 7; and Ausubel, F.M. et al. supra, ch. 4 and 16.) 

Analogous computer techniques applying BLAST are used to search for identical 

20 or related molecules in nucleotide databases such as GenBank or LIFESEQ™ database 
(Incyte Pharmaceuticals). This analysis is much faster than multiple membrane-based 
hybridizations. In addition, the sensitivity of the computer search can be modified to 
determine whether any particular match is categorized as exact or homologous. 
The basis of the search is the product score, which is defined as: 

25 

% sequence ide ntity x % maximum BLAST score 
100 

The product score takes into account both the degree of similarity between two sequences 
and the length of the sequence match. For example, with a product score of 40, the match 
30 will be exact within a 1 % to 2% error, and, with a product score of 70, the match will be 
exact. Homologous molecules are usually identified by selecting those which show 
product scores between 15 and 40, although lower scores may identify related molecules. 
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The results of northern analysis are reported as a list of libraries in which the 
transcript encoding TRANP occurs. Abundance and percent abundance are also reported. 
Abundance directly reflects the number of times a particular transcript is represented in a 
cDNA library, and percent abundance is abundance divided by the total number of 
5 sequences examined in the cDNA library. 

V. Extension of TRANP Encoding Polynucleotides 

The sequence of one of the polynucleotides of the present invention was used to 
design oligonucleotide primers for extending a partial nucleotide sequence to full length. 

10 One primer was synthesized to initiate extension of an antisense polynucleotide, and the 
other was synthesized to initiate extension of a sense polynucleotide. Primers were used 
to facilitate the extension of the known sequence "outward" generating amplicons 
containing new unknown nucleotide sequence for the region of interest. The initial primers 
were designed from the cDNA using OLIGO 4.06 (National Biosciences, Plymouth, MN), 

1 5 or another appropriate program, to be about 22 to 30 nucleotides in length, to have a GC 
content of about 50% or more, and to anneal to the target sequence at temperatures of 
about 68 °C to about 72 °C. Any stretch of nucleotides which would result in hairpin 
structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries (GlBCO/BRL) were used to extend the sequence. 

20 If more than one extension is necessary or desired, additional sets of primers are designed 
to further extend the known region. 

High fidelity amplification was obtained by following the instructions for the XL- 
PCR kit (Perkin Elmer) and thoroughly mixing the enzyme and reaction mix. PGR was 
performed using the Peltier Thermal Cycler (PTC200; M.J. Research, Watertown, MA), 

25 beginning with 40 pmol of each primer and the recommended concentrations of all other 
components of the kit, with the following parameters: • 



Step 1 94° C for 1 min (initial denaturation) 

Step 2 65° C fori min 

Step 3 68° C for 6 min 

30 Step 4 94° C for 15 sec 

Step 5 65° C fori min 

Step 6 68° C for 7 min 

Step 7 Repeat steps 4 through 6 for an additional 1 5 cycles 

Step 8 94° C for 15 sec 
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5 



Step 9 
Step 10 
Step 1 1 
Step 12 
Step 13 



65° C fori min 
68°Cfor7:15min 

Repeat steps 8 through 10 for an additional 12 cycles 
72° C for 8 min 
4° C (and holding) 



A 5 fil to 10 //l aliquot of the reaction mixture was analyzed by electrophoresis on 
a low concentration (about 0.6% to 0.8%) agarose mini-gel to determine which reactions 
were successful in extending the sequence. Bands thought to contain the largest products 

10 were excised from the gel, purified using QIAQuick™ (QIAGEN Inc., Chatsworth, CA), 
and trimmed of overhangs using Klenow enzyme to facilitate religation and cloning. 

After ethanol precipitation, the products were redissolved in 13 ^1 of ligation 
buffer, \u\ T4-DNA ligase (15 units) and 1^1 T4 polynucleotide kinase were added, and 
the mixture was incubated at room temperature for 2 to 3 hours, or overnight at 16° C. 

15 Competent E. coli cells (in 40 ^1 of appropriate media) were transformed with 3 ^1 of 
ligation mixture and cultured in 80 fA of SOC medium. (See, e.g., Sambrook, supra . 
Appendix A, p. 2.) After incubation for one hour at 37° C, the E. coli mixture was plated 
on Luria Bertani (LB) agar (See, e.g., Sambrook, supra . Appendix A, p. 1) containing 2x 
Carb. The following day, several colonies were randomly picked from each plate and 

20 cultured in 1 50 //l of liquid LB/2x Carb medium placed in an individual well of an 

appropriate commercially-available sterile 96-well microtiter plate. The following day, 5 
fxl of each overnight culture was transferred into a non-sterile 96-well plate and, after 
dilution 1:10 with water, 5 fA from each sample was transferred into a PCR array. 



25 4 units of rTth DNA polymerase, a vector primer, and one or both of the gene specific 
primers used for the extension reaction were added to each well. Amplification was 
performed using the following conditions: 



For PCR amplification, 18 of concentrated PCR reaction mix (3.3x) containing 



30 



Step 1 
Step 2 
Step 3 
Step 4 
StepS 
Step 6 
Step 7 



94° C for 60 sec 
94° C for 20 sec 
55° C for 30 sec 
72° C for 90 sec 



Repeat steps 2 through 4 for an additional 29 cycles 



72° C for 180 sec 
4°C(and holding) 



35 



Aliquots of the PCR reactions were run on agarose gels together with molecular 
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weight markers. The sizes of the PCR products were compared to the original partial 
cDNAs, and appropriate clones were selected, ligated into plasmid, and sequenced. 

The nucleotide sequences of SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:13, 
SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, and SEQ ID NO:18 are 
5 used to obtain 5' regulatory sequences using the procedure above, oligonucleotides 
designed for 5' extension, and an appropriate genomic library. 

VI. Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO: 10, SEQ ID NO:l 1, SEQ ID 
10 NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, 
or SEQ ID NO: 1 8 are employed to screen cDNAs, genomic DNAs, or mRNAs. Although 
the labeling of oligonucleotides, consisting of about 20 base pairs, is specifically 
described, essentially the same procedure is used with larger nucleotide fragments. 
Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 
15 (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 ^Ci of 
[y- 32 P] adenosine triphosphate (Amersham, Chicago, IL), and T4 polynucleotide kinase 
(DuPont NEN®, Boston, MA). The labeled oligonucleotides are substantially purified 
using a Sephadex G-25 superfine resin column (Pharmacia & Upjohn, Kalamazoo, MI). 
An aliquot containing 10 7 counts per minute of the labeled probe is used in a typical 
20 membrane-based hybridization analysis of human genomic DNA digested with one of the 
following endonucleases: Ase I, Bgl II, Eco RI, Pst I, Xba 1, or Pvu II (DuPont NEN, 
Boston, MA). 

The DNA from each digest is fractionated on a 0.7 percent agarose gel and 
transferred to nylon membranes (Nytran Plus, Schleicher & Schuell, Durham, NH). 
25 Hybridization is carried out for 1 6 hours at 40°C. To remove nonspecific signals, blots 
are sequentially washed at room temperature under increasingly stringent conditions up to 
0.1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. After XOMAT AR™ film 
(Kodak, Rochester, NY) is exposed to the blots to film for several hours, hybridization 
patterns are compared visually. 

30 

VII. Microarrays 

A chemical coupling procedure and an ink jet device can be used to synthesize 
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array elements on the surface of a substrate. (See, e.g., Baldeschweiler, supra .) An array 
analogous to a dot or slot blot may also be used to arrange and link elements to the surface 
of a substrate using thermal, UV, chemical, or mechanical bonding procedures. A typical 
array may be produced by hand or using available methods and machines and contain any 

5 appropriate number of elements. After hybridization, nonhybridized probes are removed 
and a scanner used to determine the levels and patterns of fluorescence. The degree of 
complementarity and the relative abundance of each probe which hybridizes to an element 
on the microarray may be assessed through analysis of the scanned images. 

Full-length cDN As, Expressed Sequence Tags (ESTs), or fragments thereof may 

10 comprise the elements of the microarray. Fragments suitable for hybridization can be 
selected using software well known in the art such as L ASERGENE™. Full-length 
cDNAs, ESTs ? or fragments thereof corresponding to one of the nucleotide sequences of 
the present invention, or selected at random from a cDNA library relevant to the present 
invention, are arranged on an appropriate substrate, e.g., a glass slide. The cDNA is fixed 

15 to the slide using, e.g., UV cross-linking followed by thermal and chemical treatments and 
subsequent drying. (See, e.g., Schena, M. et al. (1995) Science 270:467-470; and Shalon, 
D. et al. (1996) Genome Res. 6:639-645.) Fluorescent probes are prepared and used for 
hybridization to the elements on the substrate. The substrate is analyzed by procedures 
described above. 

20 

VIIL Complementary Polynucleotides 

Sequences complementary to the TRANP-encoding sequences, or any parts 
thereof, are used to detect, decrease, or inhibit expression of naturally occurring TRANP. 
Although use of oligonucleotides comprising from about 15 to 30 base pairs is described, 

25 essentially the same procedure is used with smaller or with larger sequence fragments. 
Appropriate oligonucleotides are designed using Oligo 4.06 software and the coding 
sequence of TRANP. To inhibit transcription, a complementary oligonucleotide is 
designed from the most unique 5 1 sequence and used to prevent promoter binding to the 
coding sequence. To inhibit translation, a complementary oligonucleotide is designed to 

30 prevent ribosomal binding to the TRANP-encoding transcript. 



IX. Expression of TRANP 
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Expression of TRANP is accomplished by subcloning the cDNA into an 
appropriate vector and transforming the vector into host cells. This vector contains an 
appropriate promoter, e.g., B-galactosidase upstream of the cloning site, operably 
associated with the cDNA of interest. (See, e.g.,Sambrook, supra , pp. 404-433; and 
5 Rosenberg, M. et al. (1983) Methods Enzymol. 101:123-138.) 

Induction of an isolated, transformed bacterial strain with isopropyl beta-D- 
thiogalactopyranoside (IPTG) using standard methods produces a fusion protein which 
consists of the first 8 residues of fl-galactosidase, about 5 to 15 residues of linker, and the 
foil length protein. The signal residues direct the secretion of TRANP into bacterial 
10 growth media which can be used directly in the following assay for activity. 

X. Demonstration of TRANP Activity 

TRANP transport activity can be demonstrated through the use of a ligand mixing 
assay that is used to measure transport from early to late endosomal compartments in 2L 

1 5 laeyis oocytes. Ovaries are dissected from adult female 2L laevis anesthetized with 
3-aminobenzoic acid ethyl ester (1 g/liter) in ice water, and oocytes are isolated. 
(Mukhopadhyay A, et al. (1997) J. Cell. Biol. 136(6): 1227-1237). Oocytes are pulsed 
with 2mg/ml avidin for 5hrs at 18° C, washed, then incubated for 16hrs to allow avidin to 
transport to a late compartment. The oocytes are then incubated with lmg/ml biotin- 

20 horseradish peroxidase (HRP) for 30 mins at 1 8° C to label early endocytic compartments. 
Varying amounts of TRANP are injected into the oocytes, which are incubated at 1 8° C. 
Oocytes are collected at several time points after TRANP injection, washed, and lysed in 
100^1 of phosphate-buffered saline containing 0.3% Triton X-100, 0.2% 
methylbenzethorium chloride, and 400 ng/ml of BSA-biotin as a scavenger. Finally, the 

25 lysates are centriftiged for 30 seconds in a microfuge, and the avidin-biotin complexes are 
immunoprecipitated using anti-avidin antibody-coated plates by incubation at 4 °C 
overnight. The plates are then washed at least 5 times to remove unbound proteins. 
Transport from the early endosomes to the late compartments is quantified by measuring 
the amount of immunoprecipitated HRP; increased transport due to TRANP is quantitated 

30 by comparison with control oocytes. 

XI. Production of TRANP Specific Antibodies 
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TRANP substantially purified using PAGE electrophoresis (see. e.g., Harrington, 
M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is used to 
immunize rabbits and to produce antibodies using standard protocols. The TRANP amino 
acid sequence is analyzed using DNASTAR software (DNASTAR Inc) to determine 
5 regions of high immunogenicity, and a corresponding oligopeptide is synthesized and used 
to raise antibodies by means known to those of skill in the art. Methods for selection of 
appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well 
described in the art. (See, e.g., Ausubel et al. supra , ch. 11.) 

Typically, the oligopeptides are 1 5 residues in length, and are synthesized using 

10 an Applied Biosystems Peptide Synthesizer Model 43 1 A using fmoc-chemistry and 
coupled to KLH (Sigma, St. Louis. MO) by reaction with N-maleimidobenzoyl-N- 
hydroxysuccinimide ester (MBS) to increase immunogenicity. (See, e.g., Ausubel et al. 
supra .) Rabbits are immunized with the oligopeptide-KLH complex in complete Freund's 
adjuvant. Resulting antisera are tested for antipeptide activity, for example, by binding the 

15 peptide to plastic, blocking with 1% BSA, reacting with rabbit antisera, washing, and 
reacting with radio-iodinated goat anti-rabbit IgG. 

XII. Purification of Naturally Occurring TRANP Using Specific Antibodies 
Naturally occurring or recombinant TRANP is substantially purified by 

20 immunoaffinity chromatography using antibodies specific for TRANP. An 

immunoaffinity column is constructed by covalently coupling anti-TRANP antibody to an 
activated chromatographic resin, such as CNBr-activated Sepharose (Pharmacia & 
Upjohn). After the coupling, the resin is blocked and washed according to the 
manufacturer's instructions. 

25 Media containing TRANP are passed over the immunoaffinity column, and the 

column is washed under conditions that allow the preferential absorbance of TRANP (e.g., 
high ionic strength buffers in the presence of detergent). The column is eluted under 
conditions that disrupt antibody/TRANP binding (e.g., a buffer of pH 2 to pH 3, or a high 
concentration of a chaotrope, such as urea or thiocyanate ion), and TRANP is collected. 

30 

XIII. Identification of Molecules Which Interact with TRANP 

TRANP, or biologically active fragments thereof, are labeled with ,25 I 

-62- 



WO 99/41373 



PCT/US99/02527 



Bolton-Hunter reagent. (See, e.g., Bolton et al. (1973) Biochem. J. 133:529.) Candidate 
molecules previously arrayed in the wells of a multi-well plate are incubated with the 
labeled TRANP, washed, and any wells with labeled TRANP complex are assayed. Data 
obtained using different concentrations of TRANP are used to calculate values for the 
5 number, affinity, and association of TRANP with the candidate molecules. 

Various modifications and variations of the described methods and systems of the 
invention will be apparent to those skilled in the art without departing from the scope and 
spirit of the invention. Although the invention has been described in connection with 
specific preferred embodiments, it should be understood that the invention as claimed 
10 should not be unduly limited to such specific embodiments. Indeed, various modifications 
of the described modes for carrying out the invention which are obvious to those skilled in 
molecular biology or related fields are intended to be within the scope of the following 
claims. 
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What is claimed is: 

1 . A substantially purified polypeptide comprising an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1 , SEQ ID NO:2, SEQ ID NO:3, SEQ 

5 ID NO:4, SEQ ID NO:5, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, and SEQ ID NO:9, 
and fragments thereof. 

2. A substantially purified variant having at least 90% amino acid identity to 
the amino acid sequence of claim 1. 

10 

3. An isolated and purified polynucleotide encoding the polypeptide of claim 

1. 

4. An isolated and purified polynucleotide variant having at least 90% 
1 5 polynucleotide identity to the polynucleotide sequence of claim 3. 

5. An isolated and purified polynucleotide which hybridizes under stringent 
conditions to the polynucleotide sequence of claim 3. 

20 6. An isolated and purified polynucleotide which is complementary to the 

polynucleotide of claim 3. . 

7. An isolated and purified polynucleotide comprising a polynucleotide 
sequence selected from the group consisting of SEQ ID NO: 10, SEQ ID NO:l 1, SEQ ID 

25 NO:12 ; SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, 
and SEQ ID NO: 18, and fragments thereof. 

8. An isolated and purified polynucleotide variant having at least 90% 
polynucleotide identity to the polynucleotide sequence of claim 7. 

30 

9. An isolated and purified polynucleotide which is complementary to the 
polynucleotide of claim 7. 
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10. An expression vector containing at least a fragment of the polynucleotide of 
claim 3. 

11. A host cell containing the expression vector of claim 10. 

5 

1 2. A method for producing a polypeptide comprising the amino acid sequence 
of SEQ ID NO: 10, SEQ ID NO:l 1, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ 
ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, and SEQ ID NO: 18, or fragments thereof, the 
method comprising the steps of: 

10 a) culturing the host cell of claim 1 1 under conditions suitable for the 

expression of the polypeptide; and 

b) recovering the polypeptide from the host cell culture. 

13. A pharmaceutical composition comprising the polypeptide of claim 1 in 
1 5 conj unction with a suitable pharmaceutical carrier. 

14. A purified antibody which specifically binds to the polypeptide of claim I. 

15. A purified agonist of the polypeptide of claim 1 . 

20 

1 6. A purified antagonist of the polypeptide of claim 1 . 

1 7. A method for treating or preventing a transport disorder, the method 
comprising administering to a subject in need of such treatment an effective amount of the 

25 pharmaceutical composition of claim 13. 

18. A method for treating or preventing cancer, the method comprising 
administering to a subject in need of such treatment an effective amount of the antagonist 
of claim 16. 

30 

19. A method for treating or preventing a transport disorder, the method 
comprising administering to a subject in need of such treatment an effective amount of the 
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antagonist of claim 16. 

20. A method for detecting a polynucleotide encoding a polypeptide having the 
amino acid sequence selected from the group SEQ ID NO: 10, SEQ ID NO:l 1, SEQ ID 
5 NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, 
and SEQ ID NO: 18, and fragments thereof, in a biological sample containing nucleic 
acids, the method comprising the steps of: 

(a) hybridizing the polynucleotide of claim 6 to at least one of the 
nucleic acids in the biological sample, thereby forming a hybridization complex; 

10 and 

(b) detecting the hybridization complex, wherein the presence of the 
hybridization complex correlates with the presence of a polynucleotide encoding 
the polypeptide in the biological sample. 

15 21. The method of claim 20 wherein the nucleic acids of the biological sample 

are amplified by the polymerase chain reaction prior to hybridization. 
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SEQUENCE LISTING 



<110> INCYTE PHARMACEUTICALS. INC. 
AU- YOUNG, Janice 
HILLMAN, Jennifer L. 
LAL, Preeti 
GUEGLER, Karl J. 
CORLEY, Neil C. 
YUE, Henry 
SANDMAN, Olga 
BAUGHN, Mariah R. 

<120> HUMAN TRANSPORT-ASSOCIATED MOLECULES 

<140> To Be Assigned 
<141> Herewith 

<150> 09/021,764 
<151> 1998-02-11 

<160> 18 

<170> PERL PROGRAM 

<210> 1 
<211> 383 
<212> PRT 

<213> Homo sapiens 

<220> - 
<223> 144861 

<400> 1 



Met lie Leu Asn 


Asp Leu 


Leu 


Gly Arg 


Lys 


Leu Ser He Met Phe 


1 


5 






10 


15 


Ser Ala Val Pro 


Ser Ala 


Ala 


Gly Tyr 


Ala 


Leu Met Ala Gly Ala 




20 






25 


30 


His Gly Leu Trp 


Met Leu 


Leu 


Leu Gly 


Arg 


Thr Leu Thr Gly Phe 




35 






40 


45 


Ala Gly Gly Leu 


Thr Ala 


Ala 


Cys He 


Pro 


Val Tyr Val Ser Glu 




50 






55 


60 


lie Ala Pro Pro 


Gly Val 


Arg 


Gly Ala 


Leu 


Gly Ala Thr Pro Gin 




65 






70 


75 


Leu Met Ala Val 


Phe Gly 


Ser 


Leu Ser 


Leu 


Tyr Ala Leu Gly Leu 




80 






85 


90 


Leu Leu Pro Trp 


Arg Trp 


Leu 


Ala Val 


Ala 


Gly Glu Ala Pro Val 




95 






100 


105 


Leu lie Met He 


Leu Leu 


Leu 


Ser Phe 


Met 


Pro Asn Ser Pro Arg 




110 






115 


.120 


Phe Leu Leu Ser 


Arg Gly 


Arg 


Asp Glu 


Glu 


Ala Leu Arg Ala Leu 




125 






130 


135 


Ala Trp Leu Arg 


Gly Thr 


Asp 


Val Asp 


Val 


His Trp Glu Phe Glu 




140 






145 


150 


Gin He Gin Asp 


Asn Val 


Arg 


Arg Gin 


Ser 


Ser Arg Val Ser Trp 




155 






160 


165 


Ala Glu Ala Arg 


Ala Pro 


His 


Val Cys 


Arg 


Pro He Thr Val Ala 




170 






175 


180 
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Leu Leu 
Leu Val 
Pro Pro 
Ser Val 
Val Leu 
His Val 
Pro Gin 
Val Leu 
Leu Thr 
Arg Cys 
Gly Trp 
Ser Ser 
Pro Glu 
Val His 



Met Arg Leu 
185 

Tyr Leu Gin 
200 

Lys Asp Asp 
215 

Leu lie Ala 
230 

Leu Phe Val 
245 

Cys Cys Gin 
260 

Ala Ser Glu 
275 

Gly Gly Leu 
290 

Leu Val Pro 
305 

Gly Gly Gly 
320 

Val Met Cys 
335 

Ala Glu Thr 
350 

Pro Ala Ser 
365 

His Leu Ala 
380 



Leu Gin 
Ser He 
Ala Ala 
Ala Leu 
Ser Gly 
Pro Asp 
Pro Gin 
Gly Ala 
Leu Leu 
Ser Glu 
Leu Gly 
Asp Thr 
Ser Ser 
Ala His 



Gin Leu Thr 
190 

Phe Asp Ser 
205 

He Val Gly 
220 

Thr Met Asp 
235 

His Leu Ser 
250 

Ser Gly Ala 
265 

Gin His Cys 
280 

Ala Leu Ala 
295 

Ala Thr Met 
310 

Gly Gin Ala 
325 

Trp Leu Glu 
340 

Ala Ala Ser 
355 

Glu Gin Ala 
370 

Val 



Gly He 
Thr Ala 
Ala Val 
Leu Ala 
Val His 
Val His 
Gly Pro 
Ala Pro 
Leu Phe 
Val Leu 
Arg Gly 
Ser His 
Thr Pro 



Thr Pro He 
195 

Val Leu Leu 
210 

Arg Leu Leu 
225 

Gly Arg Lys 
240 

Ser Gly His 
255 

Pro Leu Trp 
270 

Gly Lys Arg 
285 

Ala Gly Tyr 
300 

He Met Gly 
315 

Gly Val Lys 
330 

Val Ser Ser 
345 

Pro Thr Gly 
360 

Trp Ala Gly 
375 



<210> 2 
<211> 272 
<212> PRT 

<213> Homo sapiens 

<220> - 
<223> 607812 

<400> 2 

Met Ala Ser Ser Asn Thr Val Leu 

1 5 
Tyr Ser He Ala Gin Lys Ala Gly 
20 

Ala Glu Gly Asp Leu Gly He Val 
35 

Leu Gin Thr Lys Ala Asp Arg Leu 
50 

Ser Leu Ala Arg Lys Phe Pro Lys 
65 

Leu Val Val Trp Val Asp Pro Leu 
80 

Glu Gly Leu Leu Asp Asn Val Thr 
95 

Glu Gly Lys Ala He Ala Gly Val 
110 

Tyr Glu Ala Gly Pro Asp Ala Val 



Met Arg Leu Val Ala 


Ser Ala 


10 


15 


Met He Val Arg Arg 


Val He 


25 


30 


Glu Lys Thr Cys Ala 


Thr Asp 


40 


45 


Ala Gin Met Ser He 


Cys Ser 


55 


60 


Leu Thr He He Gly 


Glu Glu 


70 


75 


Asp Gly Thr Lys Glu 


Tyr Thr 


85 


90 


Val Leu He Gly He 


Ala Tyr 


100 


105 


He Asn Gin Pro Tyr 


Tyr Asn 


115 


120 


Leu Gly Arg Thr He 


Trp Gly 
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Val Leu 
Ala Gly 
Leu Val 
Arg Val 
Lys Ala 
Asp Thr 
Leu Thr 
Lys His 
Asp Tyr 
Val Pro 



Gly Leu 
Lys His 
Thr Asp 
Gly Gly 
Ser Ala 
Cys Ala 
Asp lie 
Met Asn 
Tyr Ala 



125 

Gly Ala 
140 

He He 
155 

Cys Val 
170 

Ala Gly 
185 

Tyr Val 
200 

Pro Glu 
215 

His Gly 
230 

Ser Ala 
245 

Ser Arg 
260 



Phe Gly 
Thr Thr 
Ala Ala 
Asn Lys 
Phe Ala 
Val He 
Asn Val 
Gly Val 
Val Pro 



130 
Phe Gin 

145 
Thr Arg 

160 
Met Asn 

175 
He He 

190 
Ser Pro 

205 
Leu His 

220 
Leu Gin 

235 
Leu Ala 

250 
Glu Ser 

265 



Leu Lys 
Ser His 
Pro Asp 
Gin Leu 
Gly Cys 
Ala Val 
Tyr His 
Thr Leu 
He Lys 



135 

Glu Val Pro 
150 

Ser Asn Lys 
165 

Ala Val Leu 
180 

He Glu Gly 
195 

Lys Lys Trp 
210 

Gly Gly Lys 
225 

Lys Asp Val 
240 

Arg Asn Tyr 
255 

Asn Ala Leu 
270 



<210> 3 
<211> 210 
<212> PRT 

<213> Homo sapiens 

<220> - 
<223> 1259384 

<400> 3 

Met Gin Arg Pro Glu Ala Trp Pro Arg Pro His Pro Gly Glu Gly 

15 10 15 

Ala Ala Ala Ala Gin Ala Gly Gly Pro Ala Pro Pro Ala Arg Ala 

20 25 30 

Gly Glu Pro Ser Gly Leu Arg Leu Gin Glu Pro Ser Leu Tyr Thr 

35 40 45 

He Lys Ala Val Phe He Leu Asp Asn Asp Gly Arg Arg Leu Leu 

50 55 60 

Ala Lys Tyr Tyr Asp Asp Thr Phe Pro Ser Met Lys Glu Gin Met 

65 70 75 

Val Phe Glu Lys Asn Val Phe Asn Lys Thr Ser Arg Thr Glu Ser 

80 85 90 

Glu He Ala Phe Phe Gly Gly Met Thr He Val Tyr Lys Asn Ser 

95 100 105 

He Asp Leu Phe Leu Tyr Val Val Gly Ser Ser Tyr Glu Asn Glu 

110 115 120 

Leu Met Leu Met Ser Val Leu Thr Cys Leu Phe Glu Ser Leu Asn 

125 130 135 

His Met Leu Arg Lys Asn Val Glu Lys Arg Trp Leu Leu Glu Asn 

140 145 150 

Met Asp Gly Ala Phe Leu Val Leu Asp Glu He Val Asp Gly Gly 

155 160 165 

Val He Leu Glu Ser Asp Pro Gin Gin Val He Gin Lys Val Asn 

170 175 180 

Phe Arg Ala Asp Asp Gly Gly Leu Thr Glu Gin Ser Val Ala Gin 
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185 



190 



195 



Val Leu Gin Ser Ala Lys Glu Gin lie Lys Trp Ser Leu Leu Lys 



<210> 4 

<211> 465 

<212> PRT 

<213> Homo sapiens 

<220> - 
<223> 1340813 

<400> 4 

Met Gly Gly Ala Val Val Asp Glu Gly Pro Thr Gly Val Lys Ala 
15 10 15 

Pro Asp Gly Gly Trp Gly Trp Ala Val Leu Phe Gly Cys Phe Val 

20 25 30 

lie Thr Gly Phe Ser Tyr Ala Phe Pro Lys Ala Val Ser Val Phe 
35 40 45 

Phe Lys Glu Leu He Gin Glu Phe Gly He Gly Tyr Ser Asp Thr 
50 55 6 0 

Ala Trp He Ser Ser He Leu Leu Ala Met Leu Tyr Gly Thr Gly 
65 70 75 

Pro Leu Cys Ser Val Cys Val Asn Arg Phe Gly Cys Arg Pro Val 
80 85 90 

Met Leu Val Gly Gly Leu Phe Ala Ser Leu Gly Met Val Ala Ala 
95 100 105 

Ser Phe Cys Arg Ser He He Gin Val Tyr Leu Thr Thr Gly Val 

110 H5 120 

He Thr Gly Leu Gly Leu Ala Leu Asn Phe Gin Pro Ser Leu He 

125 130 135 

Met Leu Asn Arg Tyr Phe Ser Lys Arg Arg Pro Met Ala Asn Gly 

140 145 150 

Leu Ala Ala Ala Gly Ser Pro Val Phe Leu Cys Ala Leu Ser Pro 

155 160 165 

Leu Gly Gin Leu Leu Gin Asp Arg Tyr Gly Trp Arg Gly Gly Phe 

170 175 180 

Leu He Leu Gly Gly Leu Leu Leu Asn Cys Cys Val Cys Ala Ala 

185 190 195 

Leu Met Arg Pro Leu Val Val Thr Ala Gin Pro Gly Ser Gly Pro 

200 205 210 

Pro Arg Pro Ser Arg Arg Leu Leu Asp Leu Ser Val Phe Arg Asp 

215 220 225 

Arg Gly Phe Val Leu Tyr Ala Val Ala Ala Ser Val Met Val Leu 

230 235 240 

Gly Leu Phe Val Pro Pro Val Phe Val Val Ser Tyr Ala Lys Asp 

245 250 255 

Leu Gly Val Pro Asp Thr Lys Ala Ala Phe Leu Leu Thr He Leu 

260 265 270 

Gly Phe He Asp He Phe Ala Arg Pro Ala Ala Gly Phe Val Ala 

275 280 285 

Gly Leu Gly Lys Val Arg Pro Tyr Ser Val Tyr Leu Phe Ser Phe 

290 295 300 

Ser Met Phe Phe Asn Gly Leu Ala Asp Leu Ala Gly Ser Thr Ala 

305 310 315 

Gly Asp Tyr Gly Gly Leu Val Val Phe Cys He Phe Phe Gly He 



200 



205 



210 



320 



325 



330 
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Ser Tyr Gly 
lie Val Gly 
Leu Met Glu 
Lys Leu Leu 
Ala Gly Ala 
Asn Phe Phe 
Val Ala Ala 
Ser Gly Val 
Pro Glu Lys 



Met Val 

335 
Thr His 

350 
Ala Val 

365 
Asp Ala 

380 
Glu Val 

395 
Cys He 

410 
Val Glu 

425 
Asp Leu 

440 
Asn Gly 

455 



Gly Ala 
Lys Phe 
Ala Val 
Thr His 
Leu Thr 
Arg Lys 
Glu Glu 
Arg Glu 
Glu Val 



Leu Gin Phe 
340 

Ser Ser Ala 
355 

Leu Val Gly 
370 

Val Tyr Met 
385 

Ser Ser Leu 
400 

Lys Pro Lys 
415 

Lys Leu His 
430 

Val Glu His 
445 

Val His Thr 
460 



Glu Val 
He Gly 
Pro Pro 
Tyr Val 
He Leu 
Glu Pro 
Lys Pro 
Phe Leu 
Pro Glu 



Leu Met Ala 
345 

Leu Val Leu 
360 

Ser Gly Gly 
375 

Phe He Leu 
390 

Leu Leu Gly 
405 

Gin Pro Glu 
420 

Pro Ala Asp 
435 

Lys Ala Glu 
450 

Thr Ser Val 
465 



<210> 5 
<211> 237 
<212> PRT 

<213> Homo sapiens 

<220> - 
<223> 1689731 

<400> 5 

Met Leu Glu Glu Asp Met Glu Val Ala He Lys Met Val Val Val 
15 10 15 

Gly Asn Gly Ala Val Gly Lys Ser Ser Met He Gin Arg Tyr Cys 
20 25 30 

Lys Gly He Phe Thr Lys Asp Tyr Lys Lys Thr He Gly Val Asp 
35 40 45 

Phe Leu Glu Arg Gin He Gin Val Asn Asp Glu Asp Val Arg Leu 
50 55 60 

Met Leu Trp Asp Thr Ala Gly Gin Glu Glu Phe Asp Ala He Thr 
65 70 75 

Lys Ala Tyr Tyr Arg Gly Ala Gin Ala Cys Val Leu Val Phe Ser 
80 B5 90 

Thr Thr Asp Arg Glu Ser Phe Glu Ala Val Ser Ser Trp Arg Glu 
95 100 105 

Lys Val Val Ala Glu Val Gly Asp He Pro Thr Val Leu Val Gin 
110 H5 120 

Asn Lys He Asp Leu Leu Asp Asp Ser Cys He Lys Asn Glu Glu 
125 130 135 

Ala Glu Ala Leu Ala Lys Arg Leu Lys Leu Arg Phe Tyr Arg Thr 
140 145 150 

Ser Val Lys Glu Asp Leu Asn Val Asn Glu Val Phe Lys Tyr Leu 
155 160 165 

Ala Glu Lys Tyr Leu Gin Lys Leu Lys Gin Gin He Ala Glu Asp 
170 175 180 

Pro Glu Leu Thr His Ser Ser Ser Asn Lys He Gly Val Phe Asn 
185 190 195 

Thr Ser Gly Gly Ser His Ser Gly Gin Asn Ser Gly Thr Leu Asn 
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200 205 210 

Gly Gly Asp Val He Asn Leu Arg Pro Asn Lys Gin Arg Thr Lys 

21S 220 225 

Lys Asn Arg Asn Pro Phe Ser Ser Cys Ser He Pro 

230 235 



<210> 6 








<211> 208 








<212> PRT 








<213> Homo sapiens 














^ »^ -j -j _ OTCI Tift 

<223 > 2751 / J U 








<400> 6 








Met Ser Ala Arg 


\j±. y Mop 


Phe Gly Asn Pro Leu Arg Lys Phe 


Lys 


1 


c 
o 


10 


15 


Lieu vai Fne L»eu 


uiy uiu 


Gin Ser Val Gly Lys Thr Ser Leu 


He 






25 


30 


Thr Arg Phe Met 


Tyr Asp 


Ser Phe Asp Asn Thr Tyr Gin Ala 


Thr 


35 


40 


45 


He Gly He Asp 


Phe Leu 


Ser Lys Thr Met Tyr Leu Glu Asp 


Arg 


50 


55 


60 


Thr Val Arg Leu 


Gin Leu 


Trp Asp Thr Ala Gly Gin Glu Arg 


Phe 




65 


70 


75 


Arg Ser Leu He 


Pro Ser 


Tyr He Arg Asp Ser Thr Val Ala 


Val 


80 


85 


90 


Val Val Tyr Asp 


He Thr 


Asn Leu Asn Ser Phe Gin Gin Thr 


Ser 




95 


100 ■ 


105 


Lys Trp He Asp 


Asp Val 


Arg Thr Glu Arg Gly Ser Asp Val 


He 


110 


115 


120 


He Met Leu Val 


Gly Asn 


Lys Thr Asp Leu Ala Asp Lys Arg 


Gin 




125 


130 


135 


He Thr He Glu 


Glu Gly 


Glu Gin Arg Ala Lys Glu Leu Ser 


Val 




140 


145 


150 


Met Phe He Glu 


Thr Ser 


Ala Lys Thr Gly Tyr Asn Val Lys 


Gin 




155 


160 


165 


Leu Phe Arg Arg 


Val Ala 


Ser Ala Leu Pro Gly Met Glu Asn 


Val 




170 


175 


180 


Gin Glu Lys Ser 


Lys Glu 


Gly Met He Asp lie Lys Leu Asp 


Lys 


185 


190 


195 


Pro Gin Glu Pro 


Pro Ala 


Ser Glu Gly Gly Cys Ser Cys 






200 


205 





<210> 7 
<211> 709 
<212> PRT 

<213> Homo sapiens 

<220> - 
<223> 2794975 
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<400> 7 

Met Ala Thr Cys Ala Glu lie Leu Arg Ser Glu Phe Pro Glu lie 
15 10 15 

Asp Gly Gin Val Phe Asp Tyr Val Thr Gly Val Leu His Ser Gly 
20 25 30 

Ser Ala Asp Phe Glu Ser Val Asp Asp Leu Val Glu Ala Val Gly 
35 40 45 

Glu Leu Leu Gin Glu Val Ser Gly Asp Ser Lys Asp Asp Ala Gly 
50 55 60 

lie Arg Ala Val Cys Gin Arg Met Tyr Asn Thr Leu Arg Leu Ala 
65 70 75 

Glu Pro Gin Ser Gin Gly Asn Ser Gin Val Leu Leu Asp Ala Pro 
80 85 90 

lie Gin Leu Ser Lys lie Thr Glu Asn Tyr Asp Cys Gly Thr Lys 
95 100 105 

Leu Pro Gly Leu Leu Lys Arg Glu Gin Ser Ser Thr Val Asn Ala 

110 115 120 

Lys Lys Leu Glu Lys Ala Glu Ala Arg Leu Lys Ala Lys Gin Glu 

125 130 135 

Lys Arg Ser Glu Lys Asp Thr Leu Lys Thr Ser Asn Pro Leu Val 

140 145 150 

Leu Glu Glu Ala Ser Ala Ser Gin Ala Gly Ser Arg Lys Glu Ser 

155 160 165 

Arg Leu Glu Ser Ser Gly Lys Asn Lys Ser Tyr Asp Val Arg lie 

170 175 180 

Glu Asn Phe Asp Val Ser Phe Gly Asp Arg Val Leu Leu Ala Gly 

185 190 195 

Ala Asp Val Asn Leu Ala Trp Gly Arg Arg Tyr Gly Leu Val Gly 

200 205 210 

Arg Asn Gly Leu Gly Lys Thr Thr Leu Leu Lys Met Leu Ala Thr 

215 220 225 

Arg Ser Leu Arg Val Pro Ala His lie Ser Leu Leu His Val Glu 

230 235 240 

Gin Glu Val Ala Gly Asp Asp Thr Pro Ala Leu Gin Ser Val Leu 

245 250 255 

Glu Ser Asp Ser Val Arg Glu Asp Leu Leu Arg Arg Glu Arg Glu 

260 265 270 

Leu Thr Ala Gin He Ala Ala Gly Arg Ala Glu Gly Ser Glu Ala 

275 280 285 

Ala Glu Leu Ala Glu He Tyr Ala Lys Leu Glu Glu He Glu Ala 

290 295 300 

Asp Lys Ala Pro Ala Arg Ala Ser Val He Leu Ala Gly Leu Gly 

305 310 315 

Phe Thr Pro Lys Met Gin Gin Gin Pro Thr Arg Glu Phe Ser Gly 

320 325 330 

Gly Trp Arg Met Arg Leu Ala Leu Ala Arg Ala Leu Phe Ala Arg 

335 340 345 

Pro Asp Leu Leu Leu Leu Asp Glu Pro Thr Asn Met Leu Asp Val 

350 355 360 

Arg Ala He Leu Trp Leu Glu Asn Tyr Leu Gin Thr Trp Pro Ser 

365 370 375 

Thr He Leu Val Val Ser His Asp Arg Asn Phe Leu Asn Ala He 

380 385 390 

Ala Thr Asp He He His Leu His Ser Gin Arg Leu Asp Gly Tyr 

395 400 405 

Arg Gly Asp Phe Glu Thr Phe He Lys Ser Lys Gin Glu Arg Leu 

410 415 420 

Leu Asn Gin Gin Arg Glu Tyr Glu Ala Gin Gin Gin Tyr Arg Gin 
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425 430 435 

His lie Gin Val Phe He Asp Arg Phe Arg Tyr Asn Ala Asn Arg 

440 445 450 

Ala Ser Gin Val Gin Ser Lys Leu Lys Met Leu Glu Lys Leu Pro 

455 460 465 

Glu Leu Lys Pro Val Asp Lys Glu Ser Glu Val Val Met Lys Phe 

470 475 480 

Pro Asp Gly Phe Glu Lys Phe Ser Pro Pro He Leu Gin Leu Asp 

485 490 495 

Glu Val Asp Phe Tyr Tyr Asp Pro Lys His Val He Phe Ser Arg 

500 505 510 

Leu Ser Val Ser Ala Asp Leu Glu Ser Arg He Cys Val Val Gly 

515 520 525 

Glu Asn Gly Ala Gly Lys Ser Thr Met Leu Lys Leu Leu Leu Gly 

530 535 540 

Asp Leu Ala Pro Val Arg Gly He Arg His Ala His Arg Asn Leu 

545 550 555 

Lys He Gly Tyr Phe Ser Gin His His Val Glu Gin Leu Asp Leu 

560 565 570 

Asn Val Ser Ala Val Glu Leu Leu Ala Arg Lys Phe Pro Gly Arg 

575 580 585 

Pro Glu Glu Glu Tyr Arg His Gin Leu Gly Arg Tyr Gly He Ser 

590 595 600 

Gly Glu Leu Ala Met Arg Pro Leu Ala Ser Leu Ser Gly Gly Gin 

605 610 615 

Lys Ser Arg Val Ala Phe Ala Gin Met Thr Met Pro Cys Pro Asn 

620 625 630 

Phe Tyr He Leu Asp Glu Pro Thr Asn His Leu Asp Met Glu Thr 

635 640 645 

He Glu Ala Leu Gly Arg Ala Leu Asn Asn Phe Arg Gly Gly Val 

650 655 660 

He Leu Val Ser His Asp Glu Arg Phe He Arg Leu Val Cys Arg 

665 670 675 

Glu Leu Trp Val Cys Glu Gly Gly Gly Val Thr Arg Val Glu Gly 

680 685 690 

Gly Phe Asp Gin Tyr Arg Ala Leu Leu Gin Glu Gin Phe Arg Arg 

695 700 705 

Glu Gly Phe Leu 



<210> 8 
<211> 962 
<212> PRT 

<213> Homo sapiens 

<220> - 
<223> 2797710 

<400> 8 

Met Asn Phe Leu Arg Gly Val Met Gly Gly Gin Ser Ala Gly Pro 
15 10 15 

Gin His Thr Glu Ala Glu Thr He Gin Lys Leu Cys Asp Arg Val 

20 25 30 

Ala Ser Ser Thr Leu Leu Asp Asp Arg Arg Asn Ala Val Arg Ala 

35 40 45 

Leu Lys Ser Leu Ser Lys Lys Tyr Arg Leu Glu Val Gly He Gin 
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50 55 60 

Ala Met Glu His Leu He His Val Leu Gin Thr Asp Arg Ser Asp 
65 70 75 

Ser Glu He He Gly Tyr Ala Leu Asp Thr Leu Tyr Asn He He 
80 85 90 

Ser Asn Glu Glu Glu Glu Glu Val Glu Glu Asn Ser Thr Arg Gin 
95 100 105 

Ser Glu Asp Leu Gly Ser Gin Phe Thr Glu He Phe He Lys Gin 
110 115 I 20 

Gin Glu Asn Val Thr Leu Leu Leu Ser Leu Leu Glu Glu Phe Asp 
12 5 130 135 

Phe His Val Arg Trp Pro Gly Val Lys Leu Leu Thr Ser Leu Leu 
140 1*5 150 

Lys Gin Leu Gly Pro Gin Val Gin Gin He He Leu Val Ser Pro 
155 160 165 

Met Gly Val Ser Arg Leu Met Asp Leu Leu Ala Asp Ser Arg Glu 
170 175 180 

Val He Arg Asn Asp Gly Val Leu Leu Leu Gin Ala Leu Thr Arg 
185 190 195 

Ser Asn Gly Ala He Gin Lys He Val Ala Phe Glu Asn Ala Phe 
200 205 210 

Glu Arg Leu Leu Asp He He Ser Glu Glu Gly Asn Ser Asp Gly 
215 220 225 

Gly He Val Val Glu Asp Cys Leu He Leu Leu Gin Asn Leu Leu 
230 235 240 

Lys Asn Asn Asn Ser Asn Arg Asn Phe Phe Lys Glu Gly Ser Tyr 
245 250 255 

He Gin Arg Met Lys Pro Trp Phe Glu Val Gly Asp Glu Asn Ser 
260 265 270 

Glv Trp Ser Ala Gin Lys Val Thr Asn Leu His Leu Met Leu Gin 
275 280 285 

Leu Val Arg Val Leu Val Ser Pro Thr Asn Pro Pro Gly Ala Thr 
290 295 300 

Ser Ser Cys Gin Lys Ala Met Phe Gin Cys Gly Leu Leu Gin Gin 
305 310 315 

Leu Cys Thr He Leu Met Ala Thr Gly Val Pro Ala Asp He Leu 
320 325 330 

Thr Glu Thr He Asn Thr Val Ser Glu Val He Arg Gly Cys Gin 
335 340 345 

Val Asn Gin Asp Tyr Phe Ala Ser Val Asn Ala Pro Ser Asn Pro 
350 355 360 

Pro Arg Pro Ala He Val Val Leu Leu Met Ser Met Val Asn Glu 
365 370 375 

Arg Gin Pro Phe Val Leu Arg Cys Ala Val Leu Tyr Cys Phe Gin 
380 385 390 

Cys Phe Leu Tyr Lys Asn Gin Lys Gly Gin Gly Glu He Val Ser 
395 400 405 

Thr Leu Leu Pro Ser Thr He Asp Ala Thr Gly Asn Ser Val Ser 
410 415 420 

Ala Gly Gin Leu Leu Cys Gly Gly Leu Phe Ser Thr Asp Ser Leu 
425 430 435 

Ser Asn Trp Cys Ala Ala Val Ala Leu Ala His Ala Leu Gin Glu 
440 445 450 

Asn Ala Thr Gin Lys Glu Gin Leu Leu Arg Val Gin Leu Ala Thr 
455 460 465 

Ser He Gly Asn Pro Pro Val Ser Leu Leu Gin Gin Cys Thr Asn 
470 475 480 

He Leu Ser Gin Gly Ser Lys He Gin Thr Arg Val Gly Leu Leu 



9 



WO 99/41373 PCT/US99/02527 



485 490 495 

Met Leu Leu Cys Thr Trp Leu Ser Asn Cys Pro lie Ala Val Thr 

500 505 510 

His Phe Leu His Asn Ser Ala Asn Val Pro Phe Leu Thr Gly Gin 

515 520 525 

lie Ala Glu Asn Leu Gly Glu Glu Glu Gin Leu Val Gin Gly Leu 

530 535 540 

Cys Ala Leu Leu Leu Gly He Ser He Tyr Phe Asn Asp Asn Ser 

545 550 555 

Leu Glu Ser Tyr Met Lys Glu Lys Leu Lys Gin Leu He Glu Lys 

560 565 570 

Arg He Gly Lys Glu Asn Phe He Glu Lys Leu Gly Phe lie Ser 

575 580 585 

Lys His Glu Leu Tyr Ser Arg Ala Ser Gin Lys Pro Gin Pro Asn 

590 595 600 

Phe Pro Ser Pro Glu Tyr Met He Phe Asp His Glu Phe Thr Lys 

605 610 615 

Leu Val Lys Glu Leu Glu Gly Val He Thr Lys Ala He Tyr Lys 

620 625 630 

Ser Ser Glu Glu Asp Lys Lys Glu Glu Glu Val Lys Lys Thr Leu 

635 640 645 

Glu Gin His Asp Asn He Val Thr His Tyr Lys Asn Met He Arg 

650 655 660 

Glu Gin Asp Leu Gin Leu Glu Glu Leu Arg Gin Gin Val Ser Thr 

665 670 675 

Leu Lys Cys Gin Asn Glu Gin Leu Gin Thr Ala Val Thr Gin Gin 

680 685 690 

Val Ser Gin He Gin Gin His Lys Asp Gin Tyr Asn Leu Leu Lys 

695 700 705 

He Gin Leu Gly Lys Asp Asn Gin His Gin Gly Ser Tyr Ser Glu 

710 715 720 

Gly Ala Gin Met Asn Gly He Gin Pro Glu Glu He Gly Arg Leu 

725 730 735 

Arg Glu Glu He Glu Glu Leu Lys Arg Asn Gin Glu Leu Leu Gin 

740 745 750 

Ser Gin Leu Thr Glu Lys Asp Ser Met He Glu Asn Met Lys Ser 

755 760 765 

Ser Gin Thr Ser Gly Thr Asn Glu Gin Ser Ser Ala He Val Ser 

770 775 780 

Ala Arg Asp Ser Glu Gin Val Ala Glu Leu Lys Gin Glu Leu Ala 

785 790 795 

Thr Leu Lys Ser Gin Leu Asn Ser Gin Ser Val Glu He Thr Lys 

800 805 810 

Leu Gin Thr Glu Lys Gin Glu Leu Leu Gin Lys Thr Glu Ala Phe 

815 820 825 

Ala Lys Ser Val Glu Val Gin Gly Glu Thr Glu Thr He He Ala 

830 835 840 

Thr Lys Thr Thr Asp Val Glu Gly Arg Leu Ser Ala Leu Leu Gin 

845 850 855 

Glu Thr Lys Glu Leu Lys Asn Glu He Lys Ala Leu Ser Glu Glu 

860 865 870 

Arg Thr Ala He Lys Glu Gin Leu Asp Ser Ser Asn Ser Thr He 
875 880 885 

Ala He Leu Gin Thr Glu Lys Asp Lys Leu Glu Leu Glu He Thr 
890 895 900 

Asp Ser Lys Lys Glu Gin Asp Asp Leu Leu Val Leu Leu Ala Asp 
905 910 915 

Gin Asp Gin Lys He Leu Ser Leu Lys Asn Lys Leu Lys Asp Leu 



10, 



WO 99/41373 



PCT/US99/02527 



920 925 930 

Gly His Pro Val Glu Glu Glu Asp Glu Leu Glu Ser Gly Asp Gin 

935 940 945 

Glu Asp Glu Asp Asp Glu Ser Glu Asp Pro Gly Lys Asp Leu Asp 

950 955 960 

His He 



<210> 9 

<211> 368 

<212> PRT 

<213> Homo sapiens 

<220> - 
<223> 2914719 

<400> 9 

Met Ser Leu Phe Gly Thr Thr Ser Gly Phe Gly Thr Ser Gly Thr 
15 10 15 

Ser Met Phe Gly Ser Ala Thr Thr Asp Asn His Asn Pro Met Lys 
20 25 30 

Asp He Glu Val Thr Ser Ser Pro Asp Asp Ser He Gly Cys Leu 
35 40 45 

Ser Phe Ser Pro Pro Thr Leu Pro Gly Asn Phe Leu He Ala Gly 
50 55 60 

Ser Trp Ala Asn Asp Val Arg Cys Trp Glu Vi'l Gin Asp Ser Gly 
65 70 75 

Gin Thr He Pro Lys Ala Gin Gin Met His Thr Gly Pro Val Leu 
80 85 90 

Asp Val Cys Trp Ser Asp Asp Gly Ser Lys Val Phe Thr Ala Ser 
95 100 105 

Cys Asp Lys Thr Ala Lys Met Trp Asp Leu Ser Ser Asn Gin Ala 

110 115 120 

He Gin He Ala Gin His Asp Ala Pro Val Lys Thr He His Trp 

125 130 135 

He Lys Ala Pro Asn Tyr Ser Cys Val Met Thr Gly Ser Trp Asp 

140 145 150 

Lys Thr Leu Lys Phe Trp Asp Thr Arg Ser Ser Asn Pro Met Met 

155 160 165 

Val Leu Gin Leu Pro Glu Arg Cys Tyr Cys Ala Asp Val He Tyr 

170 175 180 

Pro Met Ala Val Val Ala Thr Ala Glu Arg Gly Leu He Val Tyr 

185 190 195 

Gin Leu Glu Asn Gin Pro Ser Glu Phe Arg Arg He Glu Ser Pro 

200 205 210 

Leu Lys His Gin His Arg Cys Val Ala He Phe Lys Asp Lys Gin 

215 220 225 

Asn Lys Pro Thr Gly Phe Ala Leu Gly Ser He Glu Gly Arg Val 

230 235 240 

Ala He His Tyr He Asn Pro Pro Asn Pro Ala Lys Asp Asn Phe 

245 250 255 

Thr Phe Lys Cys His Arg Ser Asn Gly Thr Asn Thr Ser Ala Pro 

260 265 270 

Gin Asp He Tyr Ala Val Asn Gly He Ala Phe His Pro Val His 

275 280 285 

Gly Thr Leu Ala Thr Val Gly Ser Asp Gly Arg Phe Ser Phe Trp 
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290 295 300 

Asp Lys Asp Ala Arg Thr Lys Leu Lys Thr Ser Glu Gin Leu Asp 

305 310 315 

Gin Pro lie Ser Ala Cys Cys Phe Asn His Asn Gly Asn He Phe 

320 325 330 

Ala Tyr Ala Ser Ser Tyr Asp Trp Ser Lys Gly His Glu Phe Tyr 

335 340 345 

Asn Pro Gin Lys Lys Asn Tyr He Phe Leu Arg Asn Ala Ala Glu 

350 355 360 

Glu Leu Lys Pro Arg Asn Lys Lys 

365 



<210> 10 

<211> 1527 

<212> DNA 

<213> Homo sapiens 

<220> - 
<223> 144861 

<400> 10 

ctcgccccga gagagacccg gccatgcagg 
acgacacctt ccccgagaag ccgcccccgt 
tgcagaacaa aagggtgttc ctggccacct 
ggtatgccct g.jtctacaca tcccctgtca 
acctgcatct gaccaaatcc caggcatcct 
cggccggagg ctgagtgcca tgatcctcaa 
gttctcagct gtgccgtcgg cggccggcta 
gatgctgctg ctcggaagga cgctgacggg 
cccggtgtac gtgtctgaga ttgctccccc 
ccagctcatg gcagtgttcg gatccctgtc 
gcgctggctg gctgtggccg gggaggcgcc 
catgcccaac tcgccgcgct tcctgctctc 
gctggcctgg ctgcgtggga cggacgtcga 
caacgtccgg agacagagca gccgagtatc 
ccggcccatc accgtggcct tgctgatgcg 
catcctggtc tacctgcagt ccatcttcga 
cgacgcagcc atcgttgggg ccgtgcggct 
ggacctcgca ggccgcaagg tgctgctctt 
ccatcatgtt tgctgccaac ctgactctgg 
tgagccccaa cagcactgcg ggcctggaaa 
tgcagcaccc gctggctatc tcaccctggt 
gggtaggtgt ggtggtggct cagaggggca 
gtgtctgggg tggctggaga ggggggtctc 
ctccagtcac cccacagggc ctgaacctgc 
tggggtccat cacctggctg ctcatgtctg 
caaggcctac gccgtgggct ggggtcc 



agccgctgct gggagccgag ggcccggact 60 
cgccagggga cagggcgcgg gtcgggaccc 120 
tcgccgcagt gctcggcaat ttcagctttg 180 
tcccagccct ggagcgctcc ttggatcctg 240 
ggtttgggtc cgtgttcacc ctgggagcag 300 
cgacctcctg ggccggaagc tgagcatcat 360 
tgcgctcatg gcgggtgcgc acggcctctg 420 
cttcgccggg ggtctcacag ctgcctgcat 480 
aggcgttcgt ggggctctgg gggccacacc 540 
cctctacgcc cttggcctcc tgctgccgtg 600 
tgtgctcatc atgatcctgc tgctcagctt 660 
tcggggcagg gacgaagagg ccctgcgggc 720 
tgtccactgg gagttcgagc agatccagga 780 
gtgggctgag gcacgggccc cccacgtgtg 840 
cctcctgcag cagctgacgg gcatcacgcc 900 
cagcaccgct gtcctgctgc cccccaagga 960 
cctgtccgtg ctgatcgccg ccctcaccat 1020 
cgtctcaggc cacctgtctg tccacagcgg 1080 
ggctgtacat ccactttggc cccaggcctc 1140 
gcgagtcctg gggggacttg gcgcagccct 1200 
gcccctgctg gccaccatgc tcttcatcat 1260 
ggctgtcttg ggtgttaagg gatgggtgat 1320 
cagcagctca gcggagacag acacagccgc 1380 
ctcctcctcc gagcaggcta cgccgtgggc 1440 
aggtcctgcc ctgcgtgccg tggcgtggct 1500 

.,; 1527 



<210> 11 

<211> 1270 

<212> DNA 

<213> Homo sapiens 

<220> - 
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<223> 607812 



<400> 11 

ggatgccatc cttctcaaaa gacttattga 
agggacctgg gtctacgata acgcgctttt 
tgttccagag tgtaccatgg cttccagtaa 
atattctatt gctcaaaagg caggaatgat 
gggtattgtg gagaagacct gtgcaacaga 
gatgagcata tgttcttcat tggcccggaa 
gctcgtggtc tgggttgatc ctctggatgg 
caatgtaaca gttcttattg gaattgctta 
ccagccatat tacaactatg aggcaggacc 
agttttaggt ttaggcgcct ttgggtttca 
tatcacaact actcgatccc atagcaacaa 
ccccgatgct gtgctgcgag taggaggagc 
caaagcctct gcttatgtat ttgcaagtcc 
agaagttatt ttacatgctg tgggaggcaa 
gtaccacaag gatgtgaagc atatgaactc 
tgactactat gcaagccgag ttccagaatc 
agtttcattt ggccgggcgc ggtggctcat 
ggcaggtgga tcacttgagc tcgggagttt 
catctctaca aaaatacaaa ttaactgggc 
ttgagaggct gaagcagaag aatctcttgg 
gatcgtgcca ctgcactcca gcctgagtga 
tataaaaggg 



cagtgccaaa gctcggtact ggacacaacg 60 
gctcctcctg aagtgtcttt ggtccaacgt 120 
cactgtgttg atgcggttgg tagcctccgc 180 
agtcagacgt gttattgctg aaggagacct 240 
cctgcagacc aaagccgacc gattggcaca 300 
attccccaaa ctcacaatta taggggaaga 360 
aaccaaggaa tataccgaag gtcttcttga 420 
tgaaggaaaa gccatagcag gagttattaa 480 
agatgctgtg ttggggagga caatctgggg 540 
gctgaaagaa gtccctgctg ggaaacacat 600 
gttggttact gactgtgttg ctgctatgaa 660 
aggaaataag attattcagc tgattgaagg 720 
tggttgtaag aagtgggata cttgtgctcc 780 
gttaaccgat atccatggga atgttcttca 840 
tgcaggagtc ctggccacac tgaggaatta 900 
tattaaaaat gcacttgttc cttaaaggaa 960 
gcctgtagtc ccggcacttt gggaggccga 1020 
gagaccagcc tgggcaatat cgtgagaccc 1080 
atcctgtcat gcgcctgtcg tcccagctac 114 0 
gcccgggagg cggaggttgc agtgagctga 1200 
caggagttaa gccctgtctc agaaatagaa 1260 

1270 



<210> 12 
<211> 902 
<212> DNA 

<213> Homo sapiens 

<220> - 
<223> 1259384 

<400> 12 

gcggaatgca gcggcccgag gcctggccac 
cccaggccgg gggcccggcg ccgcctgctc 
aggaaccttc cctctacacc atcaaggctg 
tgctggccaa gtattatgat gacacattcc 
aaaatgtctt caacaagacc agccggactg 
ccatcgtcta caagaacagc attgacctct 
atgagctgat gctcatgtct gttctcacct 
ggaagaacgt ggagaagcgc tggttgctgg 
acgagattgt ggatggcggt gtgattctgg 
tgaattttag ggcagatgat ggcggcttga 
ctgccaagga acaaattaaa tggtcgttat 
tgccccccag atcatttccc caatcctggc 
acccctctgt atccccaggt ccctcccaga 
ctgagatgca aaggtttggc ctcaggagag 
catatcttag gcattcctgg ccccagggcc 
aa 



gtccgcaccc gggggagggg gccgcggcgg 60 
gagccgggga gccctcgggg ctgcggttgc 120 
ttttcatcct agataatgac gggcgccggc 180 
cctccatgaa ggagcagatg gttttcgaga 240 
agagtgagat tgcatttttt gggggtatga 300 
tcctatacgt ggtgggctca tcctacgaga 360 
gcctgtttga gtctctgaac cacatgttaa 420 
agaacatgga cggagccttc ttggtgctgg 480 
agagtgaccc ccagcaagtg atccagaagg 540 
ctgaacagag tgtggcccag gttcttcagt 600 
tgaaatgaag gctgtggatt caaggctccc 660 
aaaagcccaa agatcccagg gtcaggagag 720 
actgactcct aaggtctcca gccagggctt 780 
tcaccttttc tcacggccct ggccttaact 840 
ctaataaacc tgcttttgtc ttctgcaaat 900 

902 



<210> 13 

<211> 2026 

<212> DNA 

<213> Homo sapiens 
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<220> - 
<223> 1340813 



<400> 13 

gcagaggcgg cgagaggcgg gctgaggcgg 
cctcctggcc atgggagggg ccgtggtgga 
cggcggctgg ggctgggccg tgctcttcgg 
cttccccaag gccgtcagtg tcttcttcaa 
cagcgacaca gcctggatct cctccatcct 
ctgcagtgtg tgcgtgaacc gctttggctg 
tgcgtcgctg ggcatggtgg ctgcgtcctt 
cactggggtc atcacggggt tgggtttggc 
gaaccgctac ttcagcaagc ggcgccccat 
tgtcttcctg tgtgccctga gcccgctggg 
gggcggcttc ctcatcctgg gcggcctgct 
gaggcccctg gtggtcacgg cccagccggg 
gctagacctg agcgtcttcc gggaccgcgg 
catggtgctg gggctcttcg tcccgcccgt 
cgtgcccgac accaaggccg ccttcctgct 
gcggccggcc gcgggcttcg tggcggggct 
cttcagcttc tccatgttct tcaacggcct 
ctacggcggc ctcgtggtct tctgcatctt 
cctgcagttc gaggtgctca tggccatcgt 
cctggtgctg ccgatggagg cggtggccgt 
cctggatgcg acccacgtct acatgtacgt 
ctcctccctg attttgctgc tgggcaactt 
acagcctgag gtggcggccg tggaggagga 
ggtggacttg cgggaggtgg agcatttcct 
ggttcacacc ccggaaacaa gtgtctgagt 
aggtacagaa gccggcaacg cttgctattt 
ccacggctgg gctccagctg ccggcccagc 
» gaaggtggcg gggtgggaac cgtgtcattc 
aaggttacaa ggcatcctca ccaggggccc 
ctgctatgct caaggacctg gaaacccatg 
tgggtgggcc gcagacaggc tggcagggca 
ctaccctggg ctcacatggg gcctgtgccc 
ttccaccccc ggaagatgga aataaacctg 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 



cccagcggcg gcaggtgagg cggaaccaac 60 
cgagggcccc acaggcgtca aggcccctga 120 
ctgtttcgtc atcactggct tctcctacgc 180 
ggagctcata caggagtttg ggatcggcta 240 
gctggccatg ctctacggga caggtccgct 300 
ccggcccgtc atgcttgtgg ggggtctctt 360 
ttgccggagc atcatccagg tctacctcac 420 
actcaacttc cagccctcgc tcatcatgct 480 
ggccaacggg ctggcggcag caggtagccc 540 
gcagctgctg caggaccgct acggctggcg 600 
gctcaactgc tgcgtgtgtg ccgcactcat 660 
ctcggggccg ccgcgaccct cccggcgcct 720 
ctttgtgctt tacgccgtgg ccgcctcggt 780 
gttcgtggtg agctacgcca aggacctggg 840 
caccatcctg ggcttcattg acatcttcgc 900 
tgggaaggtg cggccctact ccgtctacct 960 
cgcggaccxg gcgggctcta cggcgggcga 1020 
ctttggcatc tcctacggca tggtgggggc 1080 
gggcacccac aagttctcca gtgccattgg 1140 
gctcgtcggg cccccttcgg gaggcaaact 1200 
gttcatcctg gcgggggccg aggcgctcac 1260 
cttctgcatt aggaagaagc ccaaagagcc 1320 
gaagctccac aagcctcctg cagactcggg 1380 
gaaggctgag cctgagaaaa acggt'gaggt 1440 
ggctgggcgg ggccggcagg cacagggagg 1500 
attttacaaa ctggactggc tcaggcaggg 1560 
ggatcgtcgc ccgatcagtg ttttgagggg 1620 
cagagtggat ctgcggtgaa gccaagccgc 1680 
cgcctgctgc tcccaggtgg cctgcggcca 1740 
cttcgagaca acgtgacttt aatgggaggg 1800 
ggtgctgcgt ggggccctct ccagcccgtc 1860 
acccctcttg agtgtcttgg ggacagctct 1920 
cgrgtgggtg gagtgttagg aaaaaaaaaa 1980 
aaaaaaaaaa aagtct 2026 



<210> 14 
<211> 2829 
<212> DNA 

<213> Homo sapiens 

<220> - 
<223> 1689731 

<400> 14 

ccgatcccaa gttccagcta gagaaggtgg 
gctcccagcg atttgagagg gaagatctga 
atttcagctc caaaaatgtt ggaggaagat 
gggaatggag cagttggaaa atcaagtatg 
aaagactaca agaaaaccat tggagttgat 
gaagatgtca gactaatgtt atgggacact 
aaggcctact atcgaggagc ccaggcttgt 
tcttttgaag cagtttccag ttggagagag 
actgtacttg tgcaaaacaa gattgatctt 



gcggcagcgc cagcccggcg ccccctccca 60 
tctcctagaa ttagagttgg tacagaaacc 120 
atggaagtcg ccataaagat ggtggttgta 180 
attcagcgat attgcaaagg catttttaca 240 
tttttggagc gacaaattca agttaatgat 300 
gcaggtcagg aggaatttga tgcaattaca 360 
gtgctcgtgt tctctaccac agatagggaa 420 
aaagtagtag ccgaagtggg agatatacca 480 
ctggatgatt cttgtataaa gaatgaggaa 540 
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gctgaggcac tggcaaaaag gttaaagtta 
ctaaatgtga atgaagtttt taagtatttg 
caaatagctg aggatccaga actaacgcat 
acatctggtg gaagtcactc cggtcagaat 
aatcttagac ccaacaaaca aaggaccaag 
ataccctaag atgttttggg aggaaaacaa 
cccatccagc tgtcatggta tgttacaagg 
ctgaaagtga cagacttaaa tattgctctg 
ctggtggaaa aattattgcc ctgtggactc 
ctcctgtttg taaaggaatg ctataagaaa 
tgctggaaaa cataaatgaa atgcctaaat 
aaggaaagat agcaagcatt cttttcctga 
tattgaatct gaatactttc acaaaataat 
ctgcattagt attttggcat tgaaacttgt 
gtgcatgcag atgtctggta ctggagtaag 
tacattctaa ttaattcact tataaaacca 
aaatgataaa ctctagacat atattttgca 
gccaaaaata tcatacctaa tttttatagt 
atatctttta aaagcagtaa gcagttttca 
gaatttgaat gctagagcat ctcccaggtg 
acccataagc atggtgagct cctgctctca 
cacagcacat ctgcagtcat tcaggaggtc 
tttgctgctc actgagaacc cagctttgaa 
tgcacaaaaa tattattatt agtcttaact 
tctaaaccat ccttttgttt acacattttt 
aacatttcta taaccttgtt tgtatagcac 
aatcatttct tttacaaaat gatttaatat 
acttgggaca gtagctttcc agctaa.;rata 
actcatcaat ggctttattt ttaacccata 
cagcattgca aaatctgtaa aaataacttt 
ttatttgtac taaacaatgt tttatgtact 
ctgtactccc ctcatcctcc taatgttgag 
aagtgatcaa aagtaaatgt tctctttgcc 
aagtaaattt ttgcttttca ttatattgaa 
gtaaaatgct tatttttatg tcaccagttg 
gttttggggg caaaattcag ttcctaacaa 
ctactctctt caattttcta ctctctactc 
ttgcaacaaa tagtcttttg tccttattgg 
ttagttctt 



agattctaca gaacatcagt gaaagaagat 600 
gctgaaaaat accttcagaa actcaaacaa 660 
tcaagtagta acaagattgg tgtctttaat 720 
tcaggtaccc tcaatggtgg agatgtcatc 780 
aaaaacagaa atccttttag cagctgtagc 840 
ttgaattaca ttgtgcaatg cattaagaaa 900 
ttttcagcag actttgctcc taatggctct 960 
cagtgctttt cagaatgtag agtgagacct 1020 
ctttaatttt ttttacatta gacgaagctt 1080 
tttcaaacac aggttttgcg gaattttaaa 1140 
atattgttac agattttaat actgtggatc 1200 
aattggtcat gtagcttttc tgcgaaatag 1260 
actgaaaggt aaaccctaaa tcttgccaga 1320 
gcaatatctg tgtaatgtag aatgtggggt 1380 
ctgaaggtat gtgcagtatt taagaaaatg 1440 
tttacatatt gtgtatatgt ttttattttc 1500 
aaatgagctt gggtttaaaa agtgctttgt 1560 
gctttactat cagaatttat attgtttatt 1620 
tctgacagcg tagataaaag aggagtatag 1680 
agccttattg cagatctttt tgatatagtc 1740 
taaggtcagc caaagcccct taagaaggta 1800 
ctgtgaagtg tggtgatgca tctaacacct 1860 
cttacaagat tacactaacc atttttaaag 1920 
aaaattacat attttataat gccctgcaac 1980 
gtatggtatt ttaagattat caatcacttc 2040 
tttctcaaga gctgtcagtc ccacatacag 2100 
gtaatcatgt gtcaggacat aaattttgtc 2160 
attgcgtgaa ctctgttaaa actgattatc 2220 
aataacccct ttcaatatac ctggttttta 2280 
tattggaaaa tgtcatgaag tcatttttca 2340 
ccttttgtgt tttatctttt gtttttccat 2400 
aaaatcaaat ttcctgggtg tgtccttgtt 2460 
attttagatg gaaaaataaa aaaaattaaa 2520 
atgatctggc attatctatt tcacagtaat 2580 
tcaccactcc cattgctgat agtgacatag 2640 
acagtggata ctgctaatga gtcatctttt 2700 
tcctctttaa tttttgcaca aaatcctact 2760 
gaggaaaatc acttgctaaa tgtgctctag 2820 

2829 



<210> 15 

<211> 1589 

<212> DNA 

<213> Homo sapiens 

<220> - 
<223> 2751730 

<400> 15 

cccggctgcc agcaccatgt ccgccagggg 
gttggtgttc ttgggggagc agagcgtcgg 
cgacagcttc gacaacacat accaggcaac 
gtacttggag gaccgcacgg tgcgactgca 
ccgcagcctg atccccagct acatccggga 
cacaaatctc aactccttcc aacagacctc 
gggcagtgat gttatcatca tgctggtggg 
gataaccatc gaggaggggg agcagcgcgc 



agattttggg aatccactga gaaaattcaa 60 
gaagacgtct ctgattacga ggttcatgta 120 
cattgggatt gacttcttgt caaaaaccat 180 
gctctgggac acagctggtc aggagaggtt 240 
ctccacggtg gctgtggtgg tgtacgacat 300 
taagtggatc gacgacgtca ggacagagag 360 
caacaagacg gacctggctg ataagaggca 420 
caaagaactg agcgtcatgt tcattgagac 480 



15 



WO 99/41373 



PCT/US99/02527 



cagtgcgaag actggctaca acgtgaagca 
cggaatggag aatgtccagg agaaaagcaa 
accccaggag cccccggcca gcgagggcgg 
gcttcccatg acactccttg cttgttgtgt 
gagggaaccg agttatcaag atgggaggat 
gtgggatggg gagggaggct gggcatcagg 
aacaactatt ttttggtttg gttgtaatat 
ttaaaattta aaaaaaattt aaatcatgtg 
tcatttttat ttgaaagatg tgctttttgt 
ttttccacac acctcctcct tcctgttctc 
ctatccccag cccaataaat taaaacaatt 
gggccatctg gccccactct ctaggcccta 
atcagcctct gaaaaggtag ctctgactct 
cctcgtgtgg agctgagacc aaggtctgtg 
aggaactgct cccacctctc cttctctttc 
caggcaacct tgtgttctct gtgccctcag 
acgaggtgct gttggcaatt gcagctgcgt 
tgtacttctg aaaatatttt aaccagccca 
gtctaagaaa ctcagtgtgt gatgttggg 



gctttttcga cgtgtggcgt cggctctacc 540 
agaagggatg attgacatca agctggacaa 600 
ctgctcctgc taatgcagag ccgacctgtg 660 
tgcttcctat tggctagctt cctaaggggg 720 
ttttcttttc tctctgtctt taggagtagg 780 
gatcacatca ctcttaacgg ctgttactta 840 
attgtacttt attaagattg ccaaaaactg 900 
tatacaattt tttgccagat aaaaatgtag 960 
ttttgtatat ttgtaaactt atagagaacc 1020 
tttgaaccat tcatcacctc tgccttcctc 1080 
aactgagcaa attaattagg cttcagtctg 1140 
ctccagttaa atcaaacatt ggttgaacac 1200 
tgtctttcca tggccagagt gggtcgtcaa 1260 
ggctctaggc agcccggcag cagcccccag 1320 
cctgaaaagc cttgaagggg gttgagggta 1380 
ggactggcag agcaagaggc catcagaagg 1440 
ggtgcgtggt ttatagactg gggtgccatg 1500 
cccagcagtc agtacacatg cctgggccaa 1560 

1589 



<210> 16 
<211> 2533 
<212> DNA 

<213> Homo sapiens 

<220> - 
<223> 2794975 

<400> 16 

cacagcgact gcgcggacgg gttcctgagt 
ggagcgagtt ccccgaaatt gacggacaag 
gcggcagcgc ggacttcgag tctgtggatg 
aagaggtgtc cggggacagc aaggatgacg 
acaacactct gcgtctggct gagccacaaa 
cccctatcca gttgtcaaag ataacggaga 
tgctaaagag ggaacagtcc tcgacagtga 
gacttaaggc aaagcaggag aagcgctcag 
tagtcttaga agaggcatca gccagccagg 
catctggcaa gaacaaatcc tatgatgtgc 
atagagtact. gctggctgga gcggatgtga 
tggggcggaa tgggttgggg aagacaacgt 
gggttccagc ccacatttcc ctgctgcacg 
ctgccctgca gagtgtgctg gagagtgaca 
gggagctcac tgcccagatt gctgctggca 
cagaaatcta tgccaaactg gaggagattg 
tcattctcgc tgggcttggc tttaccccta 
caggtggctg gaggatgagg ctggccctgg 
tgctgttaga tgaacctaca aacatgctgg 
acctgcagac gtggccctcc accatcctag 
ccatcgccac agacatcatc cacctgcaca 
ttgagacctt catcaagagt aagcaggagc 
cgcagcagca gtatcgccag cacatccagg 
acagggcctc tcaagtgcag agtaaactca 
ctgtggacaa ggaatcagag gtcgtaatga 
cgccaattct gcagctagat gaggtggatt 
gtcgcctctc tgtgtctgct gatctcgagt 
ctgggaagcc taccatgctg aagctgcttt 



ggaacatggc gacttgcgcc gaaatcctgc 60 
tcttcgacta cgtgaccggc gtcttgcaca 120 
acctggtgga agctgtaggg gaactattgc 180 
cgggcatcag ggccgtgtgc cagcgcatgt 240 
gccagggaaa tagccaggtg ctactggacg 300 
actacgactg tggaaccaaa cttccaggac 360 
atgcaaagaa gttagagaag gccgaggctc 420 
agaaggacac gctcaagacc agcaaccctc 480 
caggcagcag aaaggagagt cggttggaat 540 
gaattgagaa ctttgatgtg tcttttggcg 600 
acctggcatg gggccgccgt tacgggctgg 660 
tactgaagat gctggccacc cggagtctgc 720 
ttgagcaaga ggttgctgga gatgacactc 780 
gtgtgcgaga ggatttgcta cggagggagc 840 
gggcggaggg ctctgaagct gcagagctgg 900 
aggctgacaa ggcacctgcc agggcatcag 960 
aaatgcagca gcagcccacc cgggagttct 1020 
cccgggccct ctttgctagg ccagatcttc 1080 
atgtcagggc catcctgtgg ctggagaatt 1140 
tcgtctccca cgaccgcaac ttcttgaatg 1200 
gccagcggct agatggttac cggggagact 1260 
ggctgctcaa ccagcagcgt gaatatgagg 1320 
ttttcattga ccggtttcgc tacaatgcca 1380 
agatgctgga gaagctgcct gagctgaagc 1440 
agttccctga tgggtttgag aagttctcgc 1500 
tctactacga tccgaagcac gtcatcttca 1560 
ctcgcatctg tgtggttgga gagaatgggg 1620 
tgggggacct ggcacctgtt cggggcatca 1680 
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gacacgctca caggaatctg aagattggct 
acctaaacgt cagtgctgtg gaactgctgg 
agtaccgtca ccagctgggt cggtatggca 
ccagcctgtc tgggggccag aagagccgag 
ccaacttcta cattctggat gaacccacaa 
tgggccgtgc cctcaacaat ttcaggggtg 
ttatcaggct ggtgtgccgg gagttgtggg 
aaggaggatt tgaccagtac cgcgccctcc 
tctagggcca ccaggctgag gactcgccca 
ccaccatgta ggccaccact ccaggccgtg 
cccaaatgtc tctatccttt tgactggagc 
agggttggtg aggactggtc tcccgggggt 
agattccccc actgccccag ctctgactgg 
tctccccgcg tcaaaaaaaa aaaaaaaaaa 
aaaaaaaaaa aag 



atttcagcca gcaccatgtg gagcagctgg 1740 
cacgcaagtt tcctgggcgg cctgaggagg 1800 
tctccggaga actggccatg cgtcctcttg 1860 
tggcctttgc tcagatgact atgccctgcc 1920 
accacctgga catggagacc attgaggctc 1980 
gtgtgattct ggtgtcccac gatgagcgct 2040 
tatgcgaagg aggcggcgtc acccgtgtgg 2100 
tccaggaaca gttccgccgc gaaggcttcc 2160 
ggacatggac tggtctctca gacccctggg 2220 
gacttccccc aacttgggga cagccttatt 2280 
atcttctgca caaccttggg agcccatcca 2340 
gggggtctgg ggggtaccct ctggggttat 2400 
accccaagtg gctgctatgt aaattaaatc 2460 
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 2520 

2533 



<210> 17 
<211> 4312 
<212> DNA 

<213> Homo sapiens 

<220> - 

<223> 2797710 

<400> 17 

cttcgagccg ccacgtaatg ccacgtcccc 
ctgtttccgg gcttagaggg ctggagtggc 
aggagtgtgt agagtgcggg attgggggcc 
tctttttttt ccggaggggc cggtaaacct 
gcggggtaat ggggggtcag agtgccggac 
agctttgtga cagagtagct tcatctactt 
ctctcaaatc attatctaag aaataccgct 
ttattcatgt tttacaaaca gatcgttcag 
cactatataa tataatatct aatgaagaag 
agagtgaaga tttgggaagc caatttacag 
ctcttctgtt atctttattg gaggagtttg 
ttcttacttc tcttttaaaa caactagggc 
ctatgggtgt ttcaagattg atggacttac 
atggcgtctt actactgcag gcactaacaa 
cttttgaaaa tgctttcgag agactactgg 
gaggtatagt agttgaagat tgtttgattt 
ccaatcgaaa tttttttaaa gaaggctcat 
ttggagatga aaattctggc tggtctgcac 
agcttgttcg tgtattggta tctcccacca 
aggctatgtt ccagtgtggg ttattgcagc 
ttcctgctga tatcctgact gagaccataa 
aagtaaacca agactacttt gcatctgtaa 
ttgtagtact tctcatgtcc atggttaatg 
ttctctattg tttccagtgt ttcttgtata 
caacactttt accttctacc attgatgcaa 
tatgtggagg tttgttttct actgattcac 
cccatgcgtt gcaagaaaat gccacccaga 
caagtattgg caaccctcca gtttctttac 
gaagcaaaat acaaacaaga gttggattat 
gtcccattgc agtaacgcat tttcttcaca 
aaattgcaga aaatcttgga gaagaagagc 
tgggcatttc gatttatttc aatgataact 



gcgcatgcgc atcttggccg ctgctggcgg 60 
cgccgagttg gaggcggtgg tggcagcagt 120 
aggccctgcg gagggcgggg gaagttgtct 180 
ggtggctgaa cggcaagatg aatttcctcc 240 
cccagcacac agaagccgag acgattcaaa 300 
tattggatga tcgaagaaat gctgttcgtg 360 
tggaagtggg tatacaagct atggaacatc 420 
attctgaaat aataggttat gctttggaca 480 
aagaagaagt agaagaaaat tccacaagac 540 
aaattttcat taagcagcag gaaaatgtca 600 
atttccatgt ccgctggcct ggtgtgaagc 660 
ctcaggtgca acaaattatt ttagtcagtc 720 
tagcggattc cagggaagtt atacgtaatg 780 
gaagcaatgg tgcaatccag aaaattgttg 840 
acattatttc agaggagggg aacagtgatg 900 
tgctccaaaa cttattaaaa aacaacaact 960 
atattcaacg tatgaaacct tggtttgaag 1020 
agaaagtgac caatctacat ctaatgctac 1080 
accctcctgg tgctaccagt agctgccaga 1140 
agctttgtac tatcctaatg gctactgggg 1200 
atactgtatc agaagttatt cgaggctgcc 1260 
atgcaccttc aaacccacca agaccggcaa 1320 
aaaggcagcc atttgttttg cgctgtgctg 1380 
aaaaccaaaa aggacaagga gaaatcgtgt 1440 
caggtaattc agtttcagct ggccagttat 1500 
tttcaaactg gtgtgctgct gtggcccttg 1560 
aagaacagtt gctcagggtt caacttgcta 1620 
ttcaacagtg caccaatatt ctttcacagg 1680 
taatgttgct ttgtacctgg ctaagcaatt 1740 
attcagccaa tgttccattt cttacaggac 1800 
agttggtcca aggcttatgt gcccttttgt 1860 
cacttgagag ctacatgaaa gagaagctaa 1920 
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aacaactgat tgagaagagg attggcaaag agaatttcat agagaaacta ggatttatta 1980 
gcaaacatga gttgtattcc agagcatctc agaaacccca gccaaacttt cccagtccag 2040 
aatacatgat atttgatcat gagtttacga agctggtaaa agaacttgaa ggtgttataa 2100 
ctaaggctat ttataagtcc agtgaagaag ataaaaaaga agaagaggtg aaaaaaacat 2160 
tagaacagca tgacaatatt gtgactcact acaaaaatat gattcgagag caggatctcc 2220 
aacttgagga attaaggcag caggtttcta cattaaaatg tcaaaatgaa cagctccaga 2280 
cggcagtcac acagcaagta tcacagatcc agcagcacaa agaccagtat aatcttctta 2340 
aaatacagct aggaaaagac aatcagcatc aaggttctta cagtgagggg gctcagatga 2400 
atggcattca gccagaagaa attggtagat tgcgagaaga gatagaagaa ttaaaacgta 2460 
atcaggaact tttacaaagc cagctgactg aaaaggactc tatgattgaa aatatgaaat 2520 
cttcccaaac atctggcaca aatgaacagt cttcagcaat agtttcagct agagattctg 2580 
aacaagttgc agaattaaaa caggaactgg caactttaaa gtctcagtta aactcacaat 2640 
ctgtggagat caccaaacta cagacagaaa agcaggaact gttacagaaa acagaagcgt 2700 
ttgcaaaatc agttgaggta caaggagaga ccgagactat aatagccacc aaaactactg 2760 
atgtagaagg aagactgtca gcattattac aagagaccaa agagttaaag aatgaaatta 2820 
aagctctgtc tgaggaaaga actgccatta aagagcagct ggattcatct aatagtacca 2880 
ttgccatttt acaaactgag aaagacaaac tagagttgga aattacagat tctaaaaaag 2940 
aacaagatga tctcttggtg ctcttggccg atcaagatca gaaaatactg tcattgaaga 3000 
ataaactcaa ggatcttggt catccagttg aagaagagga tgaacttgaa tctggagacc 3060 
aagaggatga ggatgatgaa agtgaagatc ctggcaagga tctagatcat atctagtttt 3120 
caaatctcta ggaacaatag agattatatc ataactctga agatggactc taaactttgg 3180 
tttacctcca tggaaaataa agatttagaa accaggtgga aaacattcac tattaagact 3240 
attgatatat ttttgtaatg ttgccaccca tgttgaaaac cttaaaactg aatttatgta 3300 
gaacacacat cttttattta aatccatctg aactgtccca aaatgtaatt tgcaaagagc 3360 
ttcaaacacc aaaaatatac cattttaagg catgtggtga caatgctatt tgccttagtt 3420 
cattcgttgt ttactttgca aaatttgtga cttttaagta ctaacacatt tgattgaatg 3480 
tgtattgaat tggaattatt atttctaaat tcagcatttg agtccttagt tgatgatttt 3540 
cttttaaaga aattagttta tttaaaagca agatagcctg ggttgggagt tag J cgcatat 3600 
catcagccct ccatctaatg tgattgctaa gatgtgcagg utaggacagc ttagaatgag 3660 
tgccttgaca cctacatatg gctgcattcc atacctcagt gaagaagccg aaggatgctg 3720 
acacacagtg atgtggaggg cattccggag aaacttttgc aacagtgtat taatgtgatt 3780 
ggttttcttt ttgttttaag ctaaacatct aaaacaaatt tcaagttaac agttcataca 3840 
actcagattt agtattattg tttatcataa aacatattat gttcctagtt ttaaaaagac 3900 
ccaaagtatg attaatataa tttattggca tttttgctga ataggtttaa gtcagataat 3960 
agattttaaa aaagcaaatg aggcaatgtg tcaaaattta atgtttcata ataaaaatag 4020 
atacgttgac ttaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 4080 
aaaaaaaaaa aaaaaaaaaa aagggggggc cgctctaggg gtcccaggtt taggtagggg 4140 
tgaatgggag gtcaaggccc ttctaagggg tcccctaatt tcatttcacg ggcggcggtt 4200 
ttaaaaggcc gtgactggga aacccctggg gttacccaat ttaatcgcct tggggaaaac 4260 
ccccttttgg caagcgggcg taatagcgaa ggggcccgca cggttcgcct tt 4312 



<210> 18 

<211> 2370 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> unsure 
<222> 2354 

<223> a or g or c or t, unknown, or other 

<220> - 
<223> 2914719 

<400> 18 

gtttctaccg caggcttaag gaggcttcgg gctcctggga tttctgtccg cgctcctggc 60 
cctcgtcctt cgcgccagag caggttcgca aactcctcag acccttctgc tcccggccgc 120 
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cgctttccgc cggggcgaga cccccaggtt 
ttttggaacc agtgggacca gcatgtttgg 
gaaggatatt gaagtaacat catctcctga 
accaaccttg ccggggaact ttcttattgc 
ggaagttcaa gacagtggac agaccattcc 
gcttgatgtc tgctggagtg acgatgggag 
tgccaaaatg tgggacctca gcagtaacca 
tgttaaaacc atccattgga tcaaagctcc 
ggataagact ttaaagtttt gggatactcg 
ccctgaaagg tgttactgtg ctgacgtgat 
gaggggcctg attgtctatc agctagagaa 
tccactgaaa catcagcatc ggtgtgtggc 
tggttttgcc ctgggaagta tcgaggggag 
ccccgccaaa gataacttca cctttaaatg 
tcctcaggac atttatgcgg taaatggaat 
aactgtggga tctgatggta gattcagctt 
aacttcggaa cagttagatc agcccatctc 
atttgcatac gcttccagct acgactggtc 
aaaaaattac attttcctgc gtaatgcagc 
gtggctggag actctggctc agccagagtt 
gaatttgggt cccagccttg ttgggttgtc 
aaaacgatgt cattgttcag cagctgagag 
ccattccact gcctgttgca gagtttttct 
gttagattgc gggcaccgcc agggattttg 
tggaaaagcg tctgtgagcc ccgtgctgta 
cccgagcttc ctttgtcctt ttctcccctt 
ctgcccagca tcagcgtggg gttttgagtt 
caaccttagg ggaaagggag tccccagctg 
tctgtgagcc ctagtgggca gagtttgaat 
ttcaaaatcc ccagtgcttt ctggccttgc 
actgttactt gtgctgggtg acaggccatt 
gacacttgcg tgactttcaa gttagaaccc 
cttacccacc ccggtcccct gcatgggaat 
ctgttctttt gtattttgtg taatttaaaa 
tcagctgggc tgagaaaaag tgtgactttt 
ctcaaatgct tcttaggcct ttctgtttgg 
taaaattcta gtatcgatag tgttgtaaga 
aataaaattt catnctgcaa tgttagttgg 



caaaatgagc ctgtttggaa caacctcagg 180 
cagtgcaact acagacaatc acaatcccat 240 
tgatagcatt ggttgtctgt cttttagccc 300 
aggatcatgg gctaatgatg ttcgctgctg 360 
aaaagcccag cagatgcaca ctgggcctgt 420 
caaagtgttt acggcatcgt gtgataaaac 480 
agcgatacag atcgcacagc atgatgctcc 540 
aaactacagc tgtgtgatga ctgggagctg 600 
atcgtcaaat cctatgatgg ttttgcaact 660 
ataccccatg gctgtggtgg caactgcaga 720 
tcaaccttct gaattcagga ggatagaatc 780 
tatttttaaa gacaaacaga acaagcctac 840 
agttgctatt cactatatca accccccgaa 900 
tcatcgatct aatggaacca acacttcagc 960 
cgcgttccat cctgttcatg gcacccttgc 1020 
ctgggacaaa gatgccagaa caaaactaaa 1080 
agcttgctgt ttcaatcaca atggaaacat 1140 
aaagggacat gaattttata atccccagaa 1200 
cgaagagcta aagcccagga ataagaagta 1260 
gtttctctcc actctgcctc atctctgtac 1320 
agccatggac atggatttca acccctggag 1380 
cccaggcgtc cgcggcgact tgccgtctct 1440 
gtaactaagg gggttgaggt tattgtagac 1500 
cagcgcttca gtgtacgtgt tagagaatat 1560 
ttttgtaata aagtcttttg cagattgctt 1620 
gcccaccccg taacctcagg aacatgcgtc 1680 
gagatttcag acaccctctg ggaaatgcgg 1740 
cgctcacttc tgctgcgtgg aacggcagcc 1800 
gtgtttttcc ttgcttccct cattcccatc 1860 
tgctcagatt tccgagtgac tcaaatgggg 1920 
tgtgggtaac ctcctaaggc ccaagtgggt 1980 
aagccccctg catgggaatt *gccctgaact 2040 
tgccctgaac cctcagtagt ggttgtttgg 2100 
ttcctggttg gtaagtattc aaaatgaaat 2160 
gggtctgtca ctgtatttct cacctgtgat 2220 
actaatgtgt gaagtctgac ttgctgagtg 2280 
tgtgtttgcc tactcataaa aaacaatgaa 2340 

2370 
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